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THE HELIOS GENE 



Background of the Invention 

The invention relates to the Helios gene, Helios polypeptide, Helios homodimers, 
Helios/Ikaros heterodimers, Helios/ Aiolos heterodimers and methods of using Helios 
nucleic acids and polypeptides. 

Summary of the Invention 

In general, the invention features an Helios polypeptide, e.g., a polypeptide which 
includes all or part of the sequence shown in SEQ ID NO:2, SEQ ID NO;4, or SEQ ID 
NO:6. The invention also features fragments and analogs of Helios polypeptides, preferably 
having at least one biological activity of an Helios polypeptide. 

In preferred embodiments, the polypeptide is a recombinant or a substantially pure 
preparation of an Helios polypeptide. 

In preferred embodiments, the polypeptide is a vertebrate, e.g., a mammalian, e.g., a 
human polypeptide. 

In preferred embodiments, the Helios polypeptide includes additional Helios coding 
sequences 5 f to that of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. 

In preferred embodiments: the polypeptide has at least one biological activity, e.g., it 

reacts with an antibody, or antibody fragment, specific for an Helios polypeptide; the 
polypeptide includes an amino acid sequence at least 60%, 74%, 80%, 90%, 95%, 98%, or 
99% homologous to an amino acid sequence from SEQ ID NO:2, SEQ ID NO:4, or SEQ ID 
NO:6; the polypeptide includes an amino acid sequence essentially the same as an amino 
acid sequence in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6; the polypeptide is at least 
5, 10, 20, 50, 100, 150, 200, or 250 amino acids in length; the polypeptide includes at least 
5, preferably at least 10, more preferably at least 20, most preferably at least 50, 100, 150, 
200, or 250 contiguous amino acids from SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6; 
the polypeptide is preferably at least 10, but no more than 100, amino acids in length; the 
Helios polypeptide is either, an agonist or an antagonist, of a biological activity of a 
naturally occurring Helios polypeptide. 



In preferred embodiments: the Helios polypeptide is encoded by the nucleic acid 
sequence of SEQ ID NO:l, SEQ ID NO:3, or SEQ ID NO:6, or by a nucleic acid having at 
least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with the nucleic acid of SEQ ID 
NO:l, SEQ ID NO:3, or SEQ ID NO:6. For example, the Helios polypeptide can be 
encoded by a nucleic acid sequence which differs from a nucleic acid sequence of SEQ ID 
NO:l, SEQ ID NO:3, or SEQ ID NO:6 due to degeneracy in the genetic code. 

In a preferred embodiment, the Helios polypeptide encodes amino acid residues 1- 
526 of SEQ ID NO:2, residues 1-500 of SEQ ID NO:4 or residues 1-526 of SEQ ID NO:6 
or a functionally equivalent residue in the Helios sequence of another vertebrate or 
mammal, e.g., a monkey. 

In a preferred embodiment the Helios polypeptide is an agonist of a naturally- 
occurring mutant or wild type Helios polypeptide (e.g., a polypeptide having an amino acid 
sequence shown in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6). In another preferred 

embodiment, the polypeptide is an antagonist which, for example, inhibits an undesired 

activity of a naturally-occurring Helios polypeptide (e.g., a mutant polypeptide). 

In a preferred embodiment, the Helios polypeptide differs in amino acid sequence at 
1, 2, 3, 5, 10 or more residues, but preferably less than 15, from a sequence in SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:6. The differences, however, are such that the Helios 
polypeptide exhibits at least one biological activity of an Helios polypeptide, e.g., the Helios 
polypeptide retains a biological activity of a naturally occurring Helios polypeptide. In 

other preferred embodiments, the Helios polypeptide differs at up to 1, 2, 3, 5, 10 amino 
acid residues from the sequence of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. 

In preferred embodiments the Helios polypeptide includes an Helios polypeptide 
sequence, as described herein, as well as other N-terminal and/or C-terminal amino acid 
sequences. 

In preferred embodiments, the polypeptide includes all or a fragment of an amino 
acid sequence from SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, fused, in reading 
frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 
5 1 to the genomic DNA which encodes a sequence from SEQ ID NO:2, SEQ ID NO:4, or 
SEQ ID NO:6. 
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In another aspect, the invention features a fragment of an Helios polypeptide. In one 
embodiment, the fragment is a terminal fragment, e.g., an N- or C- terminal deletion, e.g., a 
zinc finger, or an internal deletion, e.g., a zinc finger or a transcriptional activation domain. 
In another embodiment, the fragment includes one or more of: a N-terminal zinc finger, e.g., 
5 N-zinc finger 1 (ZF1), N-zinc finger 2 (ZF2), N-zinc finger 3 (ZF3), N-zinc finger 4 (ZF4), 
a transcriptional activation domain, or a C-terminal zinc finger, e.g., C-zinc finger 1 (ZF5), 
C-zinc finger 2 (ZF6). In another embodiment, the Helios polypeptide includes a deletion 

of one or more of the following: a N-terminal zinc finger, e.g., N-zinc finger 1 (ZF1), N- 
zinc finger 2 (ZF2), N-zinc finger 3 (ZF3), N-zinc finger 4 (ZF4), a transcriptional 
10 activation domain, or a C-terminal zinc finger, .e.g., a C-zinc finger 1 (ZF5) or a C-zinc 
finger 2 (ZF6). In another embodiment, the fragment is at least 20, 40, 60, or 80 amino 
acids in length. 

In yet other preferred embodiments, the Helios polypeptide is a recombinant fusion 
protein having a first Helios polypeptide portion and a second polypeptide portion having an 
15 amino acid sequence unrelated to an Helios polypeptide. The second polypeptide portion 
can be, e.g., any of glutathione-S-transferase, a DNA binding domain, or a polymerase 
activating domain. In preferred embodiment the fusion protein can be used in a two-hybrid 
assay. 

In a preferred embodiment, the Helios polypeptide is a fragment or analog of a 
20 naturally occurring Helios polypeptide which inhibits reactivity with antibodies, or F(ab f )2 

fragments, specific for a naturally occurring Helios polypeptide. 

In a preferred embodiment, the Helios polypeptide includes a sequence which is not 

present in the mature protein. 

Polypeptides of the invention include those which arise as a result of the existence of 
25 multiple genes, alternative transcription events, alternative RNA splicing events, and 
alternative translational and postranslational events. 

In preferred embodiments, the Helios polypeptide has one or more of the following 
properties: 

(a) it can form a dimer with an Helios, Aiolos, or Ikaros polypeptide; 
30 (b) it is expressed in hematopoietic stem cells; 

(c) it has a molecular weight of approximately 64 kDa or 68 KDa; 
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(d) it has at least one zinc finger domain; or 

(e) it is a transcriptional activator of a lymphoid gene. 

The invention includes an immunogen which includes an active or inactive Helios 

polypeptide, or an analog or a fragment thereof, in an immunogenic preparation, the 
5 immunogen being capable of eliciting an immune response specific for the Helios 
polypeptide, e.g., a humoral response, an antibody response, or a cellular response. In 
preferred embodiments, the immunogen comprising an antigenic determinant, e.g., a unique 

determinant, from a protein represented by SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. 
The invention also includes an antibody preparation, preferably a monoclonal 

10 antibody preparation, specifically reactive with an epitope of the Helios immunogen or 
generally of an Helios polypeptide. 

In another aspect, the invention provides a substantially pure nucleic acid having, or 
comprising, a nucleotide sequence which encodes a polypeptide, the amino acid sequence of 
which includes, or is, the sequence of an Helios polypeptide, or analog or fragment thereof. 

15 In preferred embodiments, the nucleic acid encodes a vertebrate, e.g., a mammalian, 

e.g., a human polypeptide. 

In preferred embodiments, the nucleic acid encodes an Helios polypeptide which 
includes additional Helios coding sequences 5' to that SEQ ID NO:2, 4, or 6. 

In preferred embodiments, the nucleic acid encodes a polypeptide having one or 

20 more of the following characteristics: at least one biological activity of an Helios, e.g., a 

polypeptide specifically reactive with an antibody, or antibody fragment, directed against an 
Helios polypeptide; an amino acid sequence at least 60%, 74%, 80%, 90%, 95%, 98%, or 
99% homologous to an amino acid sequence from SEQ ID NO:2, SEQ ID NO:4, or SEQ ID 
NO: 6; an amino acid sequence essentially the same as an amino acid sequence in SEQ ID 

25 NO:2, SEQ ID NO:4, or SEQ ID NO:6, the polypeptide is at least 5, 10, 20, 50, 100, 150, 
200, or 250 amino acids in length; at least 5, preferably at least 10, more preferably at least 
20, most preferably at least 50, 100, 150, 200, or 250 contiguous amino acids from SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:6; an amino acid sequence which is preferably at least 
10, but no more than 100, amino acids in length; the ability to act as an agonist or an 

30 antagonist of a biological activity of a naturally occurring Helios polypeptide. 



In preferred embodiments: the nucleic acid is or includes the nucleotide sequence of 
SEQ ID NO:l, SEQ ID NO:3, or SEQ ID NO:6; the nucleic acid is at least 60%, 70%, 74%, 

80%, 90%, 95%, 98%, or 99% homologous with a nucleic acid sequence of SEQ ID NO:l, 
SEQ ID NO:3, or SEQ ID NO:6; the nucleic acid includes a fragment of SEQ ID NO:l, 
SEQ ID NO:3, or SEQ ID NO:6 which is at least 25, 50, 100, 200, 300, 400, 500, or 1,000 
bases in length; the nucleic acid differs from the nucleotide sequence of SEQ ID NO:l due 
to degeneracy in the genetic code. 

In a preferred embodiment, the Helios encoding nucleic acid sequence encodes 
amino acid residues 1-526 of SEQ ID NO:2, residues 1-500 of SEQ ID NO:4, residues 1- 
526 of SEQ ID NO:6 or a functionally equivalent residue in the Helios sequence of another 
vertebrate or mammal, e.g., a monkey. 

In a preferred embodiment the polypeptide encoded by the nucleic acid is an agonist 
which, for example, is capable of enhancing an activity of a naturally-occurring mutant or 
wild type Helios polypeptide. In another preferred embodiment, the encoded polypeptide is 
an antagonist which, for example, inhibits an undesired activity of a naturally-occurring 
Helios polypeptide (e.g., a polypeptide having an amino acid sequence shown in SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:6). 

In a preferred embodiment, the encoded Helios polypeptide differs in amino acid 
sequence at 1, 2, 3, 5, 10 or more residues, but preferably less than 15, from a sequence in 
SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6. The differences, however, are such that 
the encoded Helios polypeptide exhibits at least one biological activity of a naturally 
occurring Helios polypeptide (e.g., the Helios polypeptide of SEQ ID NO:2, SEQ ID NO;4, 
or SEQ ID NO:6). 

In preferred embodiments, the nucleic acid encodes an Helios polypeptide which 
includes an Helios polypeptide sequence, as described herein, as well as other N-terminal 
and/or C-terminal amino acid sequences. 

In preferred embodiments, the nucleic acid encodes a polypeptide which includes all 
or a portion of an amino acid sequence shown in SEQ ID NO:2, SEQ ID NO:4, or SEQ ID 
NO:6, fused, in reading frame, to additional amino acid residues, preferably to residues 
encoded by genomic DNA 5 T to the genomic DNA which encodes a sequence from SEQ ID 
NO:2, SEQ ID NO:4, or SEQ ID NO:6. 
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In preferred embodiments, the encoded polypeptide is a recombinant fusion protein 
having a first Helios polypeptide portion and a second polypeptide portion having an amino 
acid sequence unrelated to an Helios polypeptide. The second polypeptide portion can be, 
e.g., any of glutathione-S-transferase; a DNA binding domain; or a polymerase activating 
5 domain. In preferred embodiments the fusion protein can be used in a two-hybrid assay. 

In preferred embodiments, the encoded polypeptide is a fragment or analog of a 
naturally occurring Helios polypeptide which inhibits reactivity with antibodies, or F(ab')2 

fragments, specific for a naturally occurring Helios polypeptide. 

In preferred embodiments, the nucleic acid will include a transcriptional regulatory 
10 sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer 

sequence, operably linked to the Helios gene sequence, e.g., to render the Helios gene 

sequence suitable for use as an expression vector. 

In yet another preferred embodiment, the nucleic acid of the invention hybridizes 

under stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive 
15 nucleotides from SEQ ID NO: 1 , SEQ ID NO:3, or SEQ ID NO:6, or more preferably to at 

least 20 consecutive nucleotides from SEQ ID NO:l, SEQ ID NO:3, or SEQ ID NO:6, or 

more preferably to at least 40 consecutive nucleotides from SEQ ID NO:l, SEQ ID NO:3, 

or SEQ ID NO:6. 

In a preferred embodiment, the nucleic acid encodes an Helios polypeptide which 
20 includes a sequence which is not present in the mature protein. 

In preferred embodiments, the nucleic acid encodes an Helios polypeptide which has 
one or more of the following properties: 

(a) it can form a dimer with an Helios, Aiolos, or Ikaros polypeptide; 

(b) it is expressed in hematopoietic stem cells; 

25 (c) it has a molecular weight of approximately 64 kDa or 68 KDa; 

(d) it has at least one zinc finger domain; or 

(e) it is a transcriptional activator of a lymphoid gene. 

30 In another aspect, the invention includes: a vector including a nucleic acid which 

encodes an Helios polypeptide; a host cell transfected with the vector; and a method of 



producing a recombinant Helios polypeptide, including culturing the cell, e.g., in a cell 
culture medium, and isolating the Helios polypeptide, e.g., an Helios polypeptide from the 
cell or from the cell culture medium. 

In another aspect, the invention features, a purified recombinant nucleic acid having 
at least 50%, 60%, 70%, 74%, 80%, 90%, 95%, 98%, or 99% homology with a nucleotide 
sequence shown in SEQ ID NO:l, SEQ ID NO:3, or SEQ ID NO:6. 

The invention also provides a probe or primer which includes or comprises a 
substantially purified oligonucleotide. The oligonucleotide includes a region of nucleotide 
sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides 
of sense or antisense sequence from SEQ ID NO:l, SEQ ID NO:3, or SEQ ID NO:6, or 
naturally occurring mutants thereof. In preferred embodiments, the probe or primer further 

includes a label group attached thereto. The label group can be, e.g., a radioisotope, a 

fluorescent compound, an enzyme, and/or an enzyme co-factor. Preferably the 
oligonucleotide is at least 10 and less than 20, 30, 50, 100, or 150 nucleotides in length. 

The invention involves nucleic acids, e.g., RNA or DNA, encoding a polypeptide of 
the invention. This includes double stranded nucleic acids as well as coding and antisense 
single strands. 

The invention includes vertebrate, e.g., mammalian, e.g., rodent, e.g., mouse or rat, 
or human Helios polypeptides. 

In another aspect, the invention features a method of evaluating a compound for the 
ability to interact with, e.g., bind, or modulate, e.g., inhibit or promote, the activity of an 
Helios polypeptide, e.g., an Helios monomer, or an Helios-Helios dimer, an Helios-Aiolos 
dimer, or an Helios-Ikaros dimer. The method includes contacting the compound with the 

Helios polypeptide, and evaluating the ability of the compound to interact with or form a 
complex with the Helios polypeptide. This method can be performed in vitro, e.g., in a cell 
free system, or in vivo, e.g., in a two-hybrid interaction trap assay. This method can be used 
to identify naturally occurring molecules which interact with the Helios polypeptide. It can 
also be used to find natural or synthetic inhibitors of mutant or wild type Helios 
polypeptide. The compound can be a peptide or a non peptide molecule, e.g., a small 
molecule preferably 500 to 5,000 molecular weight, more preferably 500 to 1,000 molecular 
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weight, having an aromatic scaffold, e.g., a bis-amide phenol, decorated with various 
functional groups. 

In brief, a two hybrid assay system (see e.g., Bartel et al. (1993) Cellular Interaction 
in Development: A practical Approach, D.A. Hartley, ed., Oxford University Press, 
5 Oxford, pp. 153-179) allows for detection of protein-protein interactions in yeast cells. The 
known protein, e.g., an Helios polypeptide, is often referred to as the "bait" protein. The 
proteins tested for binding to the bait protein are often referred to as "fish" proteins. The 
"bait 1 ' protein, e.g., an Helios polypeptide, is fused to the GAL4 DNA binding domain. 
Potential "fish" proteins are fused to the GAL4 activating domain. If the "bait" protein and 
10 a "fish" protein interact, the two GAL4 domains are brought into close proximity, thus 
rendering the host yeast cell capable of surviving a specific growth selection. 

In another aspect, the invention features a method of identifying active fragments or 
% analogs of an Helios polypeptide. The method includes first identifying a compound, e.g., 

jf^ an Ikaros peptide, which interacts with an Helios polypeptide and determining the ability of 

vp 15 the compound to bind the candidate fragment or analog. The two hybrid assay described 

* £ above can be used to obtain fragment-binding compounds. These compounds can then be 

^3 used as "bait" to fish for and identify fragments of the Helios polypeptide which interact, 

f 2 bind, or form a complex with these compounds. 

In another aspect, the invention features a method of making an Helios polypeptide, 

tfl 20 having a non-wild type activity, e.g., an antagonist, agonist, or super agonist of a naturally 

occurring Helios polypeptide. The method includes altering the sequence of an Helios 
polypeptide (e.g., SEQ ID NO:2, SEQ ID NO:4 5 or SEQ ID NO:6) by, for example, 
substitution or deletion of one or more residues of a non-conserved region, and testing the 
altered polypeptide for the desired activity. 

25 In another aspect, the invention features a method of making a fragment or analog of 

an Helios polypeptide, e.g., an Helios polypeptide having at least one biological activity of a 
naturally occurring Helios polypeptide. The method includes altering the sequence, e.g., by 
substitution or deletion of one or more residues, preferably which are non-conserved 
residues, of an Helios polypeptide, and testing the altered polypeptide for the desired 

30 activity. 
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In another aspect, the invention features, a method of evaluating a compound for the 
ability to bind a nucleic acid encoding an Helios gene regulatory sequence. The method 
includes: contacting the compound with the nucleic acid; and evaluating ability of the 
compound to form a complex with the nucleic acid. In preferred embodiments the Helios 
5 gene regulatory sequence is functionally linked to a heterologous gene, e.g., a reporter gene. 

In another aspect, the invention features a human cell, e.g., a hematopoietic stem cell 
or a lymphocyte e.g., a T or a B cell, transformed with a nucleic acid which encodes an 
Helios polypeptide. 

10 In another aspect, the invention features a method for treating an animal, e.g., a 

human, a mouse, a transgenic animal, or an animal model for a disorder, e.g., an immune 
system disorder, e.g., a T or B cell related disorder, e.g., a nude mouse or a SCID mouse, 
including administering a therapeutically-effective amount of an Helios polypeptide to the 
t'U animal. The Helios polypeptide can be monomeric or an Helios-Helios, an Helios-Aiolos 

In 1 5 dimer, or Helios-Ikaros dimer. 

In another aspect, the invention features a method for treating an animal, e.g., a 
: M human, a mouse, a transgenic animal, or an animal model for an immune system disorder, 

f2 e.g., a T or B cell related disorder, e.g., a nude mouse or a SCID mouse. The method 

^ includes administering to the animal a cell selected, e.g., selected in vitro, for the expression 

31 20 of a product of the Helios gene, e.g., hematopoietic stem cells, e.g., cells transformed with 

;S Helios-peptide-encoding DNA, e.g., hematopoietic stem cells transformed with Helios- 

peptide-encoding DNA. 

In preferred embodiments: the cells are taken from the animal to which they are 
administered; the cells are taken from an animal which is MHC matched with the animal to 
25 which they are administered; the cells are taken from an animal which is syngeneic with the 
animal to which they are administered; the cells are taken from an animal which is of the 
same species as is the animal to which they are administered. 

In another aspect, the invention features a method for treating an animal, e.g., a 
human, a mouse, a transgenic animal, or an animal model for an immune system disorder, 
30 e.g., a T or B cell related disorder, e.g., a nude mouse or a SCID mouse. The method 
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includes administering to the animal a nucleic acid encoding an Helios peptide and 

expressing the nucleic acid. 

In another aspect, the invention features a method of evaluating the effect of a 
treatment, e.g., a treatment designed to promote or inhibit hematopoiesis, including carrying 
5 out the treatment and evaluating the effect of the treatment on the expression of the Helios 
gene. 

In preferred embodiments the treatment is administered: to an animal, e.g., a human, 
a mouse, a transgenic animal, or an animal model for an immune system disorder, e.g., a T 
or B cell related disorder, e.g., a nude mouse or a SCID mouse, or a cell, e.g., a cultured 
10 stem cell 

In another aspect, the invention features a method for determining if a subject, e.g., a 
human, is at risk for a disorder related to mis-expression of the Helios gene or a disorder of 
the immune system, e.g., an immunodeficiency, or a T or B cell related disorder, e.g., a 
disorder characterized by a shortage of T or B cells. The method includes examining the 

15 subject for the expression of the Helios gene, non-wild type expression or mis-expression 
being indicative of risk. 

In another aspect, the invention features a method for determining if a subject, e.g., a 
human, is at risk for a disorder related to mis-expression of the Helios gene or a disorder of 
the immune system, e.g., an immunodeficiency, or a T or B cell related disorder, e.g., a 

20 disorder characterized by a shortage of T or B cells. The method includes providing a 
nucleic acid sample from the subject and determining if the structure of an Helios gene 
allele of the subject differs from wild type. 

In preferred embodiments: the determination includes determining if an Helios gene 
allele of the subject has a gross chromosomal rearrangement; the determination includes 

25 sequencing the subject's Helios gene. 

In another aspect, the invention features, a method of evaluating an animal or cell 
model for a a proliferative disorder, e.g., a leukemic disorder, Hodgkin's lymphoma, a 
cutaneuous cell lymphoma, e.g., a cutaneous T cell lymphoma, or an immune disorder, e.g., 
a T cell related disorder, e.g., a disorder characterized by a shortage of T or B cells. The 

30 method includes determining if the Helios gene in the animal or cell model is expressed at a 
predetermined level or if the Helios gene is mis-expressed. In preferred embodiments: the 
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predetermined level is lower than the level in a wild type or normal animal; the 

predetermined level is higher than the level in a wild type or normal animal; or the pattern 
of isoform expression is altered from wildtype. 

In another aspect, the invention features, a transgenic animal, e.g., a mammal, e.g., a 
5 mouse or a nonhuman primate having an Helios transgene. 

In preferred embodiments the animal is a transgenic mouse having a mutated Helios 
transgene, the mutation occurring in, or altering, e.g., a domain of the Helios gene described 
herein. 

In preferred embodiments the transgenic animal, e.g., a transgenic mouse, is 
10 homozygous for null mutations, e.g., it is homozygous for a deletion of the C terminal end 
of the protein, at the Helios locus. 

%n In preferred embodiments the transgenic animal, e.g., a transgenic mouse, is 

I ji homozygous for null mutations, e.g., it is homozygous for a deletion of the C terminal end 

of the protein, at the Helios locus and includes a mutation at Ikaros or Aiolos, e.g., a 
m 15 dominant negative mutation at Ikaros or Aiolos. Preferably the Ikaros mutation is 

heterozygous. 

In other preferred embodiments the transgenic animal or cell: is heterozygous for an 
Helios transgene; homozygous for an Helios transgene; includes a first Helios transgene and 

j t '. P'ji,*. 

'l ■' i: 

iQ a second Helios transgene; includes an Helios transgene and a second transgene which is 

% ™ 20 other than an Helios transgene, e.g., an Ikaros or Aiolos transgene. 

In another aspect, the invention features a method for evaluating the effect of a 
treatment on a transgenic cell or animal having an Helios transgene, e.g., the effect of the 
treatment on the development of the immune system. The method includes administering 
the treatment to a cell or animal having an Helios transgene, and evaluating the effect of the 
25 treatment on the cell or animal. The effect can be, e.g., the effect of the treatment on: 

Helios or Ikaros expression or misexpression; the immune system or a component thereof; 
or the cell cycle. Immune system effects include e.g., T cell activation, T cell development, 
the ability to mount an immune response, the ability to give rise to a component of the 
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immune system, B cell development, NK cell development, or the ratios CD4~ i 7CD8 + , 

CD4+/CD8" and CD4-/CD8+ . 

In preferred embodiments the treatment can include: the administration of a drug, 
chemical, or other substance; the administration of ionizing radiation; the administration of 

5 an antibody, e.g., an antibody directed against a molecule or cell of the immune system; 
administration of a substance or other treatment which suppresses the immune system; or 
administration of a substance or other treatment which activates or boosts the function of 
the immune system; introduction of a nucleic acid, e.g., a nucleic acid which encodes or 
expresses a gene product, e.g., a component of the immune system; the introduction of a 

10 protein, e.g., a protein which is a component of the immune system. 

In another aspect, the invention features a method for evaluating the effect of a 
treatment on an immune system component. The method includes: (1) supplying a 
transgenic cell or animal having an Helios transgene; (2) supplying the immune system 
component; (3) administering the treatment; and (4) evaluating the effect of the treatment 

15 on the immune system component. 

In yet another aspect, the invention features a method for evaluating the interaction 
of a first immune system component with a second immune system component. The 
method includes: (1) supplying a transgenic cell or animal, e.g., a mammal, having an 
Helios transgene; (2) introducing the first and second immune system component into the 

20 transgenic cell or mammal; and (3) evaluating an interaction between the first and second 
immune system components. 

In another aspect, the invention features a method for evaluating the effect of a 

treatment on an immune system disorder, e.g., a neoplastic disorder, a leukemia or a 
lymphoma, a T cell related lymphoma, including: administering the treatment to a cell or 
25 animal having an Helios transgene, and evaluating the effect of the treatment on the cell or 
animal. The effect can be, e.g., the effect of the treatment on: Helios or Ikaros expression 
or misexpression; the immune system or a component thereof; or the cell cycle. Immune 
system effects include e.g., T cell activation, T cell development, the ability to mount an 
immune response, the ability to give rise to a component of the immune system, B cell 
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development, NK cell development, or the ratios CD4+/CD8+ CD4+/CD8" and CD4~ 
/CD8+ 

The inventors have also discovered that Ikaros and Helios can form dimers 
(heterodimers) with other polypeptides. E.g., an Ikaros polypeptide can form dimers not 
only with Ikaros polypeptides, but with other polypeptides which bind to its C terminal 
region, e.g, other polypeptides having Zinc-finger regions, e.g., Helios polypeptides. 
Similarly, an Helios polypeptide can form dimers not only with Helios polypeptides, but 
with other polypeptides which bind to its C terminal region, e.g, other polypeptides having 
Zinc-finger regions, e.g., Ikaros polypeptides. 

The invention also includes Ikaros-Helios or Aiolos/Helios dimers. The Ikaros 
member of the dimer can be any Ikaros polypeptide, e.g., any naturally occuring Ikaros or 
any Ikaros referred to in U.S.S.N.08/238,212, filed May 2, 1994, hereby incorporated by 
reference. The Aiolos member of the dimer can be any Aiolos polypeptide, e.g., any 
naturally occuring Aiolos or any Aiolos referred to in U.S.S.N.60/005,529 filed October 18, 
1995, hereby incorporated by reference. 

The invention also includes: a cell, e.g., a cultured cell or a stem cell, containing 
purified Ikaros- or Aiolos-protein-encoding-DNA and purified Helios-protein-encoding - 
DNA; a cell capable of expressing an Ikaros and an Helios protein; a cell capable of giving 
rise to a transgenic animal or to a homogeneous population of hemopoietic cells, e.g., 
lymphoid cells, e.g., T cells; an essentially homogeneous population of cells, each of which 
includes purified Ikaros- or Aiolos-protein-encoding-DNA and purified Helios-protein- 
encoding -DNA ; and a method for manufacture of a dimer of the invention including 
culturing a cell which includes a DNA, preferably a purified DNA, of the invention in a 
medium to express the peptides . 

The invention also includes: a preparation of cells, e.g., cultured cells or a stem cells, 
including a cell a containing purified Ikaros- or Aiolos-protein-encoding-DNA and a cell 
encoding purified Helios-protein-encoding -DNA. 

The invention also includes substantially pure preparation of an antibody, preferably 
a monoclonal antibody directed against an Ikaros-Helios dimer or an Aiolos-Helios dimer 
(which preferably does not bind to an Ikaros-Ikaros, Aiolos-Aiolos or Helios-Helios dimer); 
a therapeutic composition including an Ikaros-Helios dimer or an Aiolos-Helios dimer and a 
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pharmaceutically acceptable carrier; a therapeutic composition which includes a purified 

DNA of the invention and a pharmaceutical^ acceptable carrier. 

In another aspect, the invention features a method for treating an animal, e.g., a 

human, a mouse, a transgenic animal, or an animal model for an immune system disorder, 
5 e.g., a T or B cell related disorder, e.g., a nude mouse or a SCID mouse, including 

administering a therapeutically-effective amount of an Ikaros-Helios or an Aiolos-Helios 

dimer to the animal. 

In another aspect, the invention features a method for treating an animal, e.g., a 

human, a mouse, a transgenic animal, or an animal model for an immune system disorder, 
10 e.g., a T or B cell related disorder, e.g., a nude mouse or a SCID mouse including 

administering to the animal cells selected, e.g., selected in vitro, for the expression of a 

product of the Ikaros gene and of the Helios gene, e.g., hematopoietic stem cells, e.g., cells 

transformed with Ikaros- or Aiolos-peptide-encoding DNA and or Helios-peptide-encoding 

DNA, e.g., hematopoietic stem cells transformed with Ikaros or Aiolos and or Helios- 
is peptide-encoding DNA. The Ikaros Aiolos and Helios DNA can be present in the same or 

in different cells. 

In preferred embodiments: the cells are taken from the animal to which they are 
administered; the cells are taken from an animal which is MHC matched with the animal to 
which they are administered; the cells are taken from an animal which is syngeneic with the 
20 animal to which they are administered; the cells are taken from an animal which is of the 
same species as is the animal to which they are administered. 

In another aspect, the invention features a method for treating an animal, e.g., a 
human, a mouse, a transgenic animal, or an animal model for an immune system disorder, 
e.g., a T or B cell related disorder, e.g., a nude mouse or a SCID mouse, including 
25 administering to the animal a nucleic acid encoding an Ikaros peptide and a nucleic acid 
encoding an Helios peptide and expressing the nucleic acids. 

In another aspect, the invention features a method of evaluating the effect of a 
treatment, e.g., a treatment designed to promote or inhibit hematopoiesis, including carrying 
out the treatment and evaluating the effect of the treatment on the expression of the Ikaros 
30 and the Helios gene. 
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In preferred embodiments the treatment is administered: to an animal, e.g., a human, 
a mouse, a transgenic animal, or an animal model for an immune system disorder, e.g., a T 
or B cell related disorder, e.g., a nude mouse or a SCID mouse, or a cell, e.g., a cultured 
stem cell. 

In another aspect, the invention features a method for determining if a subject, e.g., a 
human, is at risk for a disorder related to mis-expression of the Ikaros gene, e.g., a leukemic 
disorder or other disorder of the immune system, e.g., an immunodeficiency, or a T or B cell 
related disorder, e.g., a disorder characterized by a shortage of T or B cells, including 
examining the subject for the expression of the Ikaros-Helios or Aiolos-Helios dimers, non- 
wild type expression or mis-expression being indicative of risk. 

In another aspect, the invention features, a method of evaluating an animal or cell 
model for an immune disorder, e.g., a T cell related disorder, e.g., a disorder characterized 

by a shortage of T or B cells, including determining if Ikaros-Helios or Aiolos-Helios 

dimers in the animal or cell model are expressed at a predetermined level. In preferred 
embodiments: the predetermined level is lower than the level in a wild type or normal 
animal; the predetermined level is higher than the level in a wild type or normal animal; or 
the pattern of isoform expression is altered from wildtype. 

In another aspect, the invention features a transgenic rodent, e.g., a mouse, having a 
transgene which includes an Ikaros or Aiolos gene or Ikaros or Aiolos protein encoding 
DNA and an Helios gene or Helios protein encoding DNA. In preferred embodiments: the 
Ikaros, Aiolos and or Helios gene or DNA includes a deletion, e.g. a deletion of all or part 
of one or more exons. 

In another aspect, the invention features, a method of culturing an Helios- 
misexpressing cell having at least one mutant allele at the Helios locus. The cell can be, 
e.g., a hematopoietic cell, e.g., a T lymphocyte. The method includes: introducing the cell 
into a mammal, wherein, preferably, the mammal is other than the one from which the cell 
has been isolated originally; and culturing the cell. 

In a preferred embodiment, the method further includes: allowing the cell to 
proliferate in the mammal. 

In preferred embodiments: the mammal is a non-human mammal, e.g., a swine, a 
nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse. 
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In a preferred embodiment, the method further includes: allowing the Helios- 
misexpressing cell cell to divide and give rise to a proliferation-deregulated cell , e.g., a 
transformed lymphocyte; providing a plurality of the proliferation-deregulated cells e.g., 
lymphocytes or transformed lymphocytes from the mammal. 

In preferred embodiments: the mammal, the cell or both, are heterozygous at the 
Helios locus; the mammal, the cell or both, carry a mutation at the Helios gene, e.g., a point 
mutation in or a deletion for all or part of the Helios gene, e.g., a mutation in the DNA 
binding region, e.g., a point mutation in, or a deletion for all or part of one or more of the 
four N-terminal zinc finger regions which mediates DNA binding of the Helios protein or 
for one or more of the two C terminal zinc finger regions which mediate dimerization of the 
Helios protein; the mammal is heterozygous or homozygous for an Helios transgene;the 
mammal, the cell or both, carry a mutation in the control region of the Helios gene. 

In preferred embodiments: the mammal, the cell or both, carry a mutation at the 
Helios gene, e.g., a point mutation or a deletion, which, inactivates one or both of 

transcriptional activation or dimerization, which decreases the half life of the protein, or 
which inactivates one or both of the C terminal Zinc finger domains. 

In preferred embodiments the cell, e.g., a cell, is homozygous for null mutations, 
e.g., it is homozygous for a deletion of the C terminal end of the protein, at the Helios locus. 

In preferred embodiments the cell, e.g., a mouse cell, is homozygous for null 
mutations, e.g., it is homozygous for a deletion of the C terminal end of the protein, at the 
Helios locus and includes a mutation at Ikaros, e.g., a dominant negative mutation at Ikaros. 
Preferably the Ikaros mutaion is heterozygous. 

In preferred embodiments: the Helios-misexpressing cell is a homozygous mutant 
Helios cell e.g., a lymphocyte; the Helios-misexpressing cell is a B lymphocyte; the Helios- 
misexpressing cell is heterozygous or homozygous for an Helios transgene. 

In preferred embodiments, the Helios-misexpressing cell is a lymphocyte and is: a 
cell which secretes one or more anti-inflammatory cytokines; a cell which is antigen or 
idiotype specific. 

In preferred embodiments: the mammal is immunized with an antigen; the cell is 
exogenously supplied and one or both of the mammal or the mammal which donates the cell 
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are immunized with an antigen. The antigen can be: an alloantigen; a xenoantigen; an 
autoantigen; a protein; or an antigen which gives rise to an anti-idiotypic lymphocyte. 

In a preferred embodiment: the Helios-misexpressing cell, e.g., a lymphocyte, is 
supplied exogenously to the mammal, e.g., to a homozygous wild-type Helios mammal or a 
mammal carrying a mutation at the Helios gene, e.g., a point mutation or a deletion for all or 
part of the Helios gene. If exogenously supplied, the cell can be a human or a nonhuman, 
e.g., a swine, nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse, 
lymphocyte. 

Helios wild type cells can be cultured in Helios misexppressing mammals. 

In another aspect, the invention features, a method of modulating the activity of, or 

* 

promoting the interaction of an Helios misexpressing cell with, a target tissue or cell. The 

method includes: supplying the target; and exposing the target to a Helios misexpressing 

cell, e.g., a hematopoietic cell, e.g., a T lymphocyte, preferably having at least one mutant 

allele at the Helios locus, preferably provided that: the target is not Helios-misexpressing; 
the target and the cell differ in genotype at a locus other than the Helios locus; the target and 
the cell are from different animals; the target and the cell are from different species; the 
target activity is modulated in a recipient mammal and either the target or the cell is from a 

donor mammal other than the recipient mammal; or the target is exposed to the cell in an in 
vitro system. 

In a preferred embodiment: the donor of the Helios-misexpressing cell is 
heterozygous or homozygous for an Helios transgene; the donor of theHelios-misexpressing 
cell is heterozygous at the Helios locus; the donor of the Helios-misexpressing cell carries a 
point mutation in or a deletion for all or part of the Helios gene, e.g., mutation in the DNA 
binding region, e.g., a point mutation in, or a deletion for all or part of one or more of the 
four N-terminal zinc finger regions which mediate Helios binding to DNA or in one or both 
of the C-terminal zinc finger regions which mediates Helios dimerization; the donor of the 
Helios-misexpressing cell is human or a non-human mammal, e.g., a swine, a monkey, a 
goat, or a rodent, e.g., a rat or a mouse. In preferred embodiments, e.g., in the case of the 
human donor, the manipulation that gives rise to Helios deregulation, e.g., an Helios lesion, 
can be made in vitro. 
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In preferred embodiments: the mammal which provides the Helios misexpressing 
cell carries a mutation at the Helios gene, e.g., a point mutation or a deletion, which, 
inactivates one or both of transcriptional activation or dimerization, which decreases the 
half life of the protein, or which inactivates one or both of the C terminal Zinc finger 
5 domains. 

In another preferred embodiment: the cell is heterozygous or homozygous for an 
Helios transgene; the cell is a heterozygous Helios cell; the cell is a homozygous mutant 
Helios cell; the lymphocyte is a T lymphocyte. 

In preferred embodiments, the cell is a lymphocyte and is: a T cell; a cell which 
10 secretes one or more anti-inflammatory cytokines; a T cell which is antigen or idiotype 
specific. 

In a preferred embodiment: the method is performed in an in vitro system; the 

method is performed in vivo, e.g., in a mammal, e.g., a rodent, e.g., a mouse or a rat, or a 

primate, e.g., a non-human primate or a human. If the method is performed in vitro, the 
15 donor of the target cell or tissue and the lymphocyte can be same or different. If the method 

is performed in vivo, there is a recipient animal and one or more donors. 

In preferred embodiments: the method is performed in vivo and one or more of the 

recipient, the donor of the target cell or tissue, the donor of the cell, is immunized with an 

antigen; the method is performed in vitro and one or more of the donor of the target cell or 
20 tissue, the donor of the cell is immunized with an antigen. The antigen can be: an 

alloantigen; a xenoantigen or an autoantigen; a protein; or an antigen which gives rise to an 

anti-idiotypic lymphocyte. 

In a preferred embodiment: the target is selected from a group consisting of T or B 

lymphocytes, macrophages, inflammatory leukocytes, e.g., neutrophils or eosinophils, 
25 mononuclear phagocytes, NK cells or T lymphocytes; the target is an antigen presenting 

cell, e.g., a professional antigen presenting cell or a nonprofessional antigen presenting cell; 

the target is spleen tissue, bone marrow tissue, lymph node tissue or thymic tissue, or the 

target is a syngeneic, allogeneic, or xenogeneic tissue. 

In another preferred embodiment, the target is from a mammal, e.g., a human; the 
30 mammal is a non-human mammal, e.g., a swine, a monkey, a goat, or a rodent, e.g., a rat or 

a mouse. 
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In preferred embodiments, the activity of the target which is modulated is: the 
production of a cytokine; the proliferation or activation of a cell of the immune system; the 
production of an antibody; the lysis of an antigen presenting cell or the activation of a 
cytolytic T lymphocyte; the effect of target on resistance to infection; the effect of target on 
life span; the effect of target on body weight; the effect of target on the presence, function, 
or morphology of tissues or organs of the immune system; the effect of target on the ability 
of a component of the immune system to respond to a stimulus (e.g., a diffusable substance, 
e.g., cytokines, other cells of the immune system, or antigens); the effect of target on the 
ability to exhibit immunological tolerance to an alloantigen or a xenoantigen. 

In preferred embodiments the interaction is the binding of an antibody produced by 
the Helios misexpressing cell with the target. 

In preferred embodiments: the target and the cell differ in genotype at a locus other 
than the Helios locus; the target and the cell are from different animals; the target is not 
Helios-misexpressing. 

In another aspect, the invention features, a method of reconstituting an immune 
system. The method includes: supplying a recipient mammal, and introducing, preferably 
exogenously, into the recipient mammal, an immune system component from a donor 
mammal, which is Helios misexpressing, e.g., which carries at least one mutant allele at the 
Helios locus. The recipient mammal, can be, e.g., a human or a nonhuman mammal, e.g., a 

swine, a nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse. The 
donor mammal can be, e.g., a human or a nonhuman mammal, e.g., a swine, a monkey, a 
goat, or a rodent, e.g., a rat or a mouse. If the donor mammal is human, the manipulation 
that gives rise to Helios misexpression e.g., an the introduction of an Helios lesion, can be 
made in vitro. The donor mammal and the recipient mammal can be different individuals or 
the same individual. 

In preferred embodiments, the component is or includes an Helios misexpressing 
cell, e.g., a hematopoietic cell, e.g., a pluripotent stem cell, or a descendent of a stem cell, 
e.g., a lymphocyte. 
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In preferred embodiments, the component is from a donor mammal, e.g., a human or 
a nonhuman mammal, e.g., a swine, a monkey, a goat, or a rodent, e.g., a rat or a mouse. 

In a preferred embodiment, the method further includes: prior to introduction of a 
component into the subject, treating the lymphocyte to inhibit proliferation, e.g., by 

irradiating the component. 

In a preferred embodiment, the donor mammal carries a mutation at the Helios gene, 
e.g., a deletion of all or part of the Helios gene. 

In another preferred embodiment: the immune system component is any of a T cell, 
a T cell progenitor, a totipotent hematopoietic stem cell, a pluripotent hematopoietic stem 
cell, a B cell progenitor, a natural killer cell, a natural killer cell progenitor, bone marrow 
tissue, spleen tissue, or thymic tissue. 

In a preferred embodiment: the immune system component is from the same species 
as the recipient mammal; the immune system component is from species different from the 
species of the recipient mammal 

In preferred embodiments: the recipient mammal is a wild-type animal; an animal 
model for a human disease, e.g., a NOD mouse; the animal is immunocompromised by 
irradiation, chemotherapy, or genetic defect, e.g., the animal is a SCE) mouse or a nude 
mouse; the recipient is deficient in an immune function, e.g., the recipient has been 
thymectomized, depleted of an immune system component, e.g., of cells or antibodies; the 
recipient has been administered chemotherapy or irradiation. 

In preferred embodiments: the immune system component is heterozygous at the 
Helios locus; the immune system component is carries a mutation at the Helios gene, e.g., a 
point mutation in or a deletion for all or part of the Helios gene, e.g., a mutation in the DNA 
binding region, e.g., a point mutation in, or a deletion for all or part of one or more of the 
four N-terminal zinc finger regions which mediates DNA binding of the Helios protein or 
for one or more of the two C terminal zinc finger regions which mediate dimerization of the 
Helios protein;the immune system component is heterozygous or homozygous for an Helios 
transgene; the immune system component carries a mutation in the control region of the 
Helios gene. 

In preferred embodiments: the immune system component carries a mutation at the 
Helios gene, e.g., a point mutation or a deletion, which, inactivates one or both of 
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transcriptional activation or dimerization, which decreases the half life of the protein, or 
which inactivates one or both of the C terminal Zinc finger domains. 

In preferred embodiments: the method is performed in vivo, and the recipient 
mammal or the donor mammal or both are immunized with an antigen. The antigen can be: 
an alloantigen; a xenoantigen or an autoantigen; a protein; or an antigen which gives rise to 
an anti-idiotypic lymphocyte. 

In a preferred embodiment, the method further includes: determining a value for a 
parameter related to immune system function. The parameter related to the immune system 
function can be any of: the production of a cytokine; the proliferation or activation of a cell 
of the immune system; the production of an antibody; the lysis of an antigen presenting cell 
or the activation of a cytolytic T lymphocyte; resistance to infection; life span; body weight; 
the presence, function, or morphology of tissues or organs of the immune system; the ability 
of a component of the immune system to respond to a stimulus (e.g., a diffusable substance, 
e.g., cytokines, other cells of the immune system, or antigens); the ability to present an 
antigen; the ability to exhibit immunological tolerance to an alloantigen or a xenoantigen. 

In another aspect, the invention features, a method of evaluating the interaction of an 
Helios misexpressing cell, e.g., a hematopoietic cell, a T lymphocyte, with an immune 
system component. The method includes: supplying an animal, e.g., a swine, a nonhuman 
primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse; introducing the cell and 
the immune component into the animal; and evaluating the interaction between the Helios 
misexpressing cell and the immune system component. 

In a preferred embodiment, the method further includes: prior to introduction of a 
cell into the subject, treating the lymphocyte to inhibit proliferation, e.g., by irradiating the 
cell. 

In a preferred embodiment: the immune system component is any of a T cell, a T 
cell progenitor, a totipotent hematopoietic stem cell, a pluripotent hematopoietic stem cell, a 
B cell, a B cell progenitor, a natural killer cell, a natural killer cell progenitor, bone marrow 
tissue, spleen tissue, or thymic tissue; the immune system component is from the same 
species as the animal; the immune system component is from species different from the 
species of the animal; the immune system component is from the same species as the 



lymphocyte; the immune system component is from species different from the species from 
which the lymphocyte is obtained. 

In another preferred embodiment: the cell is from the same species as the animal; the 
cell is from a species which is different from the species of the animal. 
5 In another preferred embodiment: the recipient mammal is a wild-type animal; an 

animal model for a human disease, e.g., a NOD mouse; the animal is immunocompromised 
by irradiation, chemotherapy, or genetic defect, e.g., the animal is a SCID mouse or a nude 
mouse; the recipient is deficient in an immune function, e.g., the recipient has been 
thymectomized, depleted of an immune system component, e.g., of cells or antibodies; the 

10 recipient has been administered chemotherapy or irradiation. 

In a preferred embodiment: the cell is heterozygous or homozygous for an Helios 
transgene. 

In preferred embodiments evaluating can include evaluating any of: the production 
of a cytokine; the proliferation or activation of a cell of the immune system; the production 

15 of an antibody; the lysis of an antigen presenting cell or the activation of a cytolytic T 
lymphocyte; resistance to infection; life span; body weight; the presence, function, or 
morphology of tissues or organs of the immune system; the ability of a component of the 
immune system to respond to a stimulus (e.g., a diffusable substance, e.g., cytokines, other 
cells of the immune system, or antigens); the ability to present an antigen; the ability to 

20 exhibit immunological tolerance to an alloantigen or a xenoantigen. 

In preferred embodiments: the method is performed in vivo, and one or more of the 
animal, the donor of the Helios misexpressing cell, the donor of the immune system 
component, is immunized with an antigen. The antigen can be: an alloantigen; a 
xenoantigen or an autoantigen; a protein; or an antigen which gives rise to an anti-idiotypic 

25 lymphocyte. 

In another aspect, the invention features, a mammal, e.g., a nonhuman mammal, e.g., 
e.g., a swine, a nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse, 
having an exogenously introduced immune system component, the component being from a 
30 human or nonhuman mammal, e.g., a swine, a nonhuman primate, e.g., a monkey, a goat, or 
a rodent, e.g., a rat or a mouse, or cell culture which is Helios misexpressing or which 
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carries at least one mutant allele at the Helios locus. In preferred embodiments, e.g., if the 
immune system component is from a wild-type animal, e.g., a human, the manipulation that 
gives rise to Helios deregulation, e.g., an Helios lesion, can be made in vitro. 

In preferred embodiments, the component is from a human or nonhuman mammal, 
e.g., a swine, a nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse, 
which is Helios misexpressing. 

In preferred embodiments: the component is from a mammal which is Helios 
misexpressing; the component is from a mammal which is heterozygous at the Helios locus; 
the component is from a mammal which carries a mutation at the Helios gene, e.g., a point 
mutation in or a deletion for all or part of the Helios gene, e.g., a mutation in the DNA 
binding region, e.g., a point mutation in, or a deletion for all or part of one or more of the 
four N-terminal zinc finger regions which mediates DNA binding of the Helios protein or 
for one or more of the two C terminal zinc finger regions which mediate dimerization of the 
Helios protein; the component is from a mammal which is heterozygous or homozygous for 
an Helios transgene; the component is from a mammal which carries a mutation in the 

control region of the Helios gene. 

In preferred embodiments: the component is from a mammal which carries a 
mutation at the Helios gene, e.g., a point mutation or a deletion, which, inactivates one or 
both of transcriptional activation or dimerization, which decreases the half life of the 
protein, or which inactivates one or both of the C terminal Zinc finger domains. 

In preferred embodiments, the immune system component is: a helper T cell; 
cytolytic T cell; a suppressor T cell; a T cell which secretes one or more anti-inflammatory 
cytokines, e.g., IL-4, IL-10, or IL-13; a T cell which is antigen or idiotype specific; a 

suppressor T cell which is anti-idiotypic for an auto antibody or for an antibody which 
recognizes an allograft or xenograft tissue; the lymphocyte is an antigen-nonspecific T cell 
In another preferred embodiment: the immune system component is any of a T cell 
progenitor, a totipotent hematopoietic stem cell, a pluripotent hematopoietic stem cell, a B 
cell, a B cell progenitor, a natural killer cell, a natural killer cell progenitor, bone marrow 
tissue, spleen tissue, or thymic tissue; the immune system component is from the same 
species as the animal; the immune system component is from species different from the 
species of the animal. 
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In preferred embodiments; the mammal or the donor animal which produces the 
immune system component or both are immunized with an antigen. The antigen can be: an 
alloantigen; a xenoantigen or an autoantigen; a protein; or an antigen which gives rise to an 
anti-idiotypic lymphocyte. 

In another aspect, the invention features, a reaction mixture, preferably an in vitro 
reaction mixture, including an immune system component, the component including cells 
which misexpress Helios or being from an animal or cell culture which is misexpresses 
Helios or which carries at least one mutant allele at the Helios locus, and a target tissue or 
cell, wherein preferably, the immune system component and the target differ in genotype at 
a locus other than the Helios or Ikaros locus; the component and the target are from 
different species, or the component and the target are from different animals. 

In preferred embodiments, the component is from an animal or cell culture which 
misexpresses Helios. 

In preferred embodiments: the immune system component is a lymphocyte 
heterozygous or homozygous for an Helios transgene, e.g., a transgene having a point 
mutation or a deletion, which, inactivates one or both of transcriptional activation or 
dimerization, which decreases the half life of the protein, or which inactivates one or both of 
the C terminal Zinc finger domains; the immune system component is a lymphocyte 
heterozygous or homozygous for a C terminal deletion. 

In preferred embodiments, the immune system component is: a B cell. 

In another preferred embodiment: the immune system component is any of a T cell 

progenitor, a totipotent hematopoietic stem cell, a pluripotent hematopoietic stem cell, a B 

cell, a B cell progenitor, a natural killer cell, a natural killer cell progenitor, bone marrow 
tissue, spleen tissue, or thymic tissue; the immune system component is from the same 
species as the target cell; the immune system component is from species different from the 
species of the target cell. 

In a preferred embodiment: the target is selected from a group consisting of T or B 
lymphocytes, macrophages, inflammatory leukocytes, e.g., neutrophils or eosinophils, 
mononuclear phagocytes, NK cells or T lymphocytes; the target is an antigen presenting 
cell, e.g., a professional antigen presenting cell or a nonprofessional antigen presenting cell; 
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the target is spleen tissue, lymph node tissue, bone marrow tissue or thymic tissue, or is 
syngeneic, allogeneic, xenogeneic, or congenic tissue. 

In preferred embodiments: the donor of the immune system component or the donor 
of the target or both are immunized with an antigen. The antigen can be: an alloantigen; a 
xenoantigen or an autoantigen; a protein; or an antigen which gives rise to an anti-idiotypic 
lymphocyte. 

In preferred embodiments the donor of the components is: a human or nonhuman 
mammal, e.g., a swine, a nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat 
or mouse. In preferred embodiments, e.g., in the case of a wild-type donor, e.g., a human, 
the manipulation that gives rise to Helios deregulation, e.g., an Helios lesion, can be 
introduced in vitro. 

In preferred embodiments the donor of the target is: a human or nonhuman mammal, 
e.g., a swine, a nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or mouse. 

In preferred embodiments the reaction mixture includes an exogenously add 
cytokine or antigen, e.g., a protein antigen. 

In another aspect, the invention features a cell, or purified preparation of cells, 
which include an Helios transgene, or which otherwise misexpress an Helios gene. The cell 
preparation can consist of human or non human cells, e.g., rodent cells, e.g., mouse or rat 
cells, rabbit cells, or pig cells. In preferred embodiments, the cell or cells include an Helios 
transgene, e.g., a heterologous form of an Helios gene, e.g., a gene derived from humans (in 
the case of a non-human cell). The Helios transgene can be misexpressed, e.g., 
overexpressed or underexpressed. In other preferred embodiments, the cell or cells include 
a gene which misexpress an endogenous Helios gene, e.g., a gene the expression of which is 

disrupted, e.g., a knockout. Such cells can serve as a model for studying disorders which 
are related to mutated or mis-expressed Helios alleles or for use in drug screening. 

In another aspect, the invention features, a method of providing an antibody, e.g., a 
polyclonal or monoclonal antibody. The method includes: 

providing a mammal, e.g., a mouse, having a cell which is Helios deregulated, e.g., 
which misexpresses, preferably underexpresses, Helios, e.g., a hematopoietic cell; and 

isolating an antibody from the animal or from a cell derived from the animal, e.g., a 
hybridoma. 
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In preferred embodiments: the mammal is immunized with an antigen; the cell is 
exogenously supplied and one or both of the mammal, or the mammal which donates the 
cell, are immunized with an antigen. The antigen can be: an alloantigen; a xenoantigen; an 
autoantigen; a protein; or an antigen which gives rise to an anti-idiotypic lymphocyte. In 
5 preferred embodiments the antigen is an autoantigen and the animal is not immunized. 

In preferred embodiments: the mammal is a non-human mammal, e.g., a swine, a 
nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse. 

In a preferred embodiment, the method further includes: allowing the Helios- 
misexpressing cell to divide and give rise to a proliferation-deregulated or antibody 
10 producing cell, e.g., a lymphocyte. 

In preferred embodiments: the proliferation-deregulated or antibody producing cell 
e.g., a lymphocyte, e.g., a transformed lymphocyte, is isolated from a lymphoma of the 
mammal. 

In preferred embodiments: the mammal carries a mutation at the Helios gene, e.g., a 
15 point mutation in or a deletion for all or part of the Helios gene, e.g., a mutation in the DNA 
binding region, e.g., a point mutation in, or a deletion for all or part of one or more of the 
four N-terminal zinc finger regions which mediate DNA binding of the Helios protein or for 
one or more of the two C terminal zinc finger regions which mediate dimerization of the 

Helios protein; the mammal is heterozygous or homozygous for an Helios transgene; the 
20 mammal carries a mutation in the control region of the Helios gene. 

In preferred embodiments the mammal, e.g., a mouse, is homozygous for null 
mutations, e.g., it is homozygous for a deletion of the C terminal end of the protein, at the 
Helios locus. 

In preferred embodiments the mammal, e.g., a mouse, is homozygous for null 
25 mutations, e.g., it is homozygous for a deletion of the C terminal end of the protein, at the 
Helios locus and includes a mutation at Ikaros, e.g., a dominant negative mutation at Ikaros. 
Preferably the Ikaros mutaion is heterozygous. 

In preferred embodiments: the mammal carries homozygous mutations at the Helios 
gene, e.g., a point mutation or a deletion, which, inactivates one or both of transcriptional 
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activation or dimerization, which decreases the half life of the protein, or which inactivates 
one or both of the C terminal Zina finger domains. 

In preferred embodiments: the proliferation-deregulated or antibody producing cell 
is a homozygous mutant Helios cell e.g., a lymphocyte; the proliferation-deregulated or 
antibody producing lymphocyte is a B lymphocyte; the proliferation-deregulated or 
antibody producing cell is heterozygous or homozygous for an Helios transgene. 

In preferred embodiments, the cell is a lymphocyte and is: a cell which secretes one 
or more anti-inflammatory cytokines; a cell which is antigen or idiotype specific; a cell 
which produces, or over produces, antibodies, e.g., IgG, IgA, or IgE antibodies. 

In a preferred embodiment: the Helios-misexpressing cell, e.g., a lymphocyte, is 
supplied exogenously to the mammal, e.g., to a homozygous wild-type Helios mammal or a 
mammal carrying a mutation at the Helios gene, e.g., a point mutation or a deletion for all or 
part of the Helios gene. If exogenously supplied, the cell can be a human or a nonhuman, 
e.g., a swine, nonhuman primate, e.g., a monkey, a goat, or a rodent, e.g., a rat or a mouse, 
lymphocyte. The exogenously supplied cell can be homozygous for null mutations, e.g., 
homozygous for a deletion of the C terminal end of the protein, at the Helios locus. The 
exogenously supplied cell can be homozygous for null mutations, e.g., homozygous for a 
deletion of the C terminal end of the protein, at the Helios locus and include a mutation at 
Ikaros, e.g., a dominant negative mutation at Ikaros. Preferably the Ikaros mutaion is 
heterozygous. 

In a preferred embodiment the method further comprises isolating one or more cells, 
e.g., lymphocytes, from the mammal, and allowing the cell or cells to proliferate into a 
clonal population of cells, e.g., lymphocytes. 

In a preferred embodiment the method further comprises isolating one or more cells, 
e.g., lymphocytes, from the mammal, and allowing the cell or cells to proliferate into a 
clonal population of cells, e.g., lymphocytes, and isolating the antibody therefrom. 

In preferred embodiments a cell from the animal is fused with a second cell to 
provide a hybridoma. 

In preferred embodiments a cell from the animal is fused with a second cell to 
provide a hybridoma and the antibody is isolated from the hybridoma. 
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Cells, e.g., stem cells, treated by the method of the invention can be introduced into 
mammals, e.g., humans, non-human primates, or other mammals, e.g., rodents. In preferred 
embodiments the treatment is performed ex vivo and: the cell is autologous, e.g., it is 
returned to the same individual from which it was derived; the cell is allogeneic, i.e., it is 
from the same species as the mammal to which it is administered; the cell is xenogeneic, 
i.e., it is from a different species from the mammal to which it is administered. 

An Helios-deregulated cell is a cell which has a mutant or misexpressed Helios 
gene, e.g., an inactiviated Helios gene. 

A hematopoietic cell, can be, e.g., stem cell, e.g., a totipotent or a pluripotent stem 
cell, or a descendent of a stem cell, e.g., a lymphocyte, e.g. a B lymphocyte or a T 

lymphocyte. 

An Helios misexpressing animal, as used herein, is an animal in which one or more, 
and preferably substantially all, of the cells misexpress Helios. 

A mutation at the Helios locus, as used herein, includes any mutation which alters 
the expression, structure, or activity of the Helios gene or its gene product. These include 
point mutations in and in particular deletions of all or part of the Helios coding region or its 
control region. 

An exogenously supplied cell, tissue, or cell product, e.g., a cytokine, as used herein, 
is a cell, tissue, or a cell product which is derived from an animal other than the one to 
which is supplied or administered. It can be from the same species or from different species 
than the animal to which it is supplied. 

A substantially homogenous population of two or more cells e.g., lymphocytes, as 
used herein, means a population of cells in which at least 50% of the cells, more preferably 
at least 70% of the cells, more preferably at least 80% of the cells, most preferably at least 
90%, 95% or 99% of the subject cell type, e.g., lymphocytes. With respect to the Helios 
locus however, the cells can be all (+/-), all (-/-), or a mixture of (+/-) and (-/-) cells. 

Culturing, as used herein, means contacting a cell or tissue with an environment 
which will support viability of the cell or tissue and which preferably supports proliferation 
of the cell or tissue. 

A substantially purified preparation of cells, e.g., lymphocytes, as used herein, 
means a preparation of cells in which at least 50% of the cells, more preferably at least 70% 
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of the cells, more preferably at least 80% of the cells, most preferably at least 90%, 95% or 
99% of the cells of the subject cell, e.g., are lymphocytes. With respect to the Helios locus 
however, the cells can be all (+/-), all (-/-), or a mixture of (+/-) and (-/-) cells. 

Immunocompromised, as used herein, refers to a mammal in which at least one 
5 aspect of the immune system functions below the levels observed in a wild-type mammal. 
The mammal can be immunocompromised by a chemical treatment, by irradiation, or by a 
genetic lesion resulting in, e.g., a nude, a beige, a nude-beige, or an Ikaros - phenotype. The 
mammal can also be immunocompromised by an acquired disorder, e.g., by a virus, e.g., 
HIV. 

10 As used herein, an Helios transgene, is a transgene which includes all or part of an 

Helios coding sequence or regulatory sequence. The term also includes DNA sequences 
which when integrated into the genome disrupt or otherwise mutagenize the Helios locus. 
Helios transgenes sequences which when integrated result in a deletion of all or part of the 
Helios gene. Included are transgenes: which upon insertion result in the misexpression of 

15 an endogenous Helios gene; which upon insertion result in an additional copy of an Helios 
gene in the cell; which upon insertion place a non-Helios gene under the control of an 
Helios regulatory region. Also included are transgenes: which include a copy of the Helios 
gene having a mutation, e.g., a deletion or other mutation which results in misexpression of 
the transgene (as compared with wild type); which include a functional copy of an Helios 

20 gene (i.e., a sequence having at least 5% of a wild type activity, e.g., the ability to support 
the development of T, B, or NK cells); which include a functional (i.e., having at least 5% 
of a wild type activity, e.g., at least 5% of a wild type level of transcription) or 
nonfunctional (i.e., having less than 5% of a wild type activity, e.g., less than a 5% of a wild 
type level of transcription) Helios regulatory region which can (optionally) be operably 

25 linked to a nucleic acid sequence which encodes a wild type or mutant Helios gene product 
or, a gene product other than an Helios gene product, e.g., a reporter gene, a toxin gene, or a 
gene which is to be expressed in a tissue or at a developmental stage at which Helios is 
expressed. Preferably, the transgene includes at least 10, 20, 30, 40 , 50, 100, 200, 500, 
1,000, or 2,000 base pairs which have at least 50, 60, 70, 80, 90, 95, or 99 % homology with 

30 a naturally occurring Helios sequence. Preferably, the transgene includes a deletion of all or 
some of exons 3 and 4, or a deletion for some or all of exon 7 of the Helios gene. 
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A "heterologous promoter", as used herein is a promoter which is not naturally 

associated with the Helios gene. 

A "purified preparation" or a "substantially pure preparation" of an Helios 
polypeptide, or a fragment or analog thereof (or an Helios-Helios or Helios-Ikaros dimer), 
as used herein, means an Helios polypeptide, or a fragment or analog thereof (or an Helios- 
Helios or Helios-Ikaros dimer), which is free of one or more other proteins lipids, and 
nucleic acids with which the Helios polypeptide (or an Helios-Helios or Helios-Ikaros 
dimer) naturally occurs. Preferably, the polypeptide, or a fragment or analog thereof (or an 
Helios-Helios or Helios-Ikaros dimer), is also separated from substances which are used to 
purify it, e.g., antibodies or gel matrix, such as polyacrylamide. Preferably, the polypeptide, 
or a fragment or analog thereof (or an Helios-Helios or Helios-Ikaros dimer), constitutes at 
least 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the 
preparation contains: sufficient polypeptide to allow protein sequencing; at least 1, 10, or 
100 g of the polypeptide; at least 1, 10, or 100 mg of the polypeptide. 

A "purified preparation of cells", as used herein, refers to, in the case of plant or 
animal cells, an in vitro preparation of cells and not an entire intact plant or animal. In the 
case of cultured cells or microbial cells, it consists of a preparation of at least 10% and more 
preferably 50% of the subject cells. 

A "treatment", as used herein, includes any therapeutic treatment, e.g., the 
administration of a therapeutic agent or substance, e.g., a drug. 

A "substantially pure nucleic acid", e.g., a substantially pure DNA encoding an 
Helios polypeptide, is a nucleic acid which is one or both of: not immediately contiguous 
with one or both of the coding sequences with which it is immediately contiguous (i.e., one 
at the 5' end and one at the 3' end) in the naturally-occurring genome of the organism from 
which the nucleic acid is derived; or which is substantially free of a nucleic acid sequence 
with which it occurs in the organism from which the nucleic acid is derived. The term 
includes, for example, a recombinant DNA which is incorporated into a vector, e.g., into an 
autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or 
eukaryote, or which exists as a separate molecule (e.g., a cDNA or a genomic DNA 
fragment produced by PCR or restriction endonuclease treatment) independent of other 
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DNA sequences. Substantially pure DNA also includes a recombinant DNA which is part 
of a hybrid gene encoding additional Helios sequences. 

"Homologous", as used herein, refers to the sequence similarity between two 
polypeptide molecules or between two nucleic acid molecules. When a position in both of 
the two compared sequences is occupied by the same base or amino acid monomer subunit, 
e.g., if a position in each of two DNA molecules is occupied by adenine, then the molecules 
are homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two sequences 
divided by the number of positions compared x 100. For example, if 6 of 10, of the 
positions in two sequences are matched or homologous then the two sequences are 60% 
homologous. By way of example, the DNA sequences ATTGCC and TATGGC share 50% 
homology. Generally, a comparison is made when two sequences are aligned to give 
maximum homology. 

To determine the percent homology of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino acid or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in one sequence (SEQ ID NO:2) is occupied by the same amino 
acid residue or nucleotide as the corresponding position in theother sequence, then the 
molecules are homologous at that position (i.e., as used herein amino acid or nucleic acid 
"homology" is equivalent to amino acid or nucleic acid "identity"). The percent homology 
between the two sequences is a function of the number of identical positions shared by the 
sequences (i.e., % homology = # of identical positions/total # of positions x 100). For 
example, if 6 of 10, of the positions in two sequences are matched or homologous then the 
two sequences are 60% homologous or have 60% sequence identity. BY way of example, 
the DNA sequences ATTGCC and TATGGC share 50% homology or sequence identity. 
Generally, a comparison is made when two sequences are aligned to give e maximum 
homology or sequence identity. 

The comparison of sequences and determination of percent homology between two 
sequences can be accomplished using a mathematical algorithm. A preferred, non-limiting 
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example of a mathematical algorithm utilized for the comparison of two sequences is the 
algorithm of Karlin and Altschul (1990) Proc. Natl Acad. Sci. USA 87:2264-68, modified 
as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such an 
algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) of 

5 Altschul, et al (1990) J. Mol. Biol. 215:403-10. Blast nucleotide searches can be performed 
with the NBLAST program, score=100, wordlength-12 to obtain nucleotide sequences 
homologous to the nucleic acids of the invention. BLAST protein searches can be 
performed with the XBLAST program, score-50, wordlength=3 to obtain amino acid 
sequences homologous to protein molecules of the invention. To obtain gapped alignments 

10 for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al, 
(1997) Nucleic Acids Research 25(17):3389-3402. When utilizing BLAST and Gapped 
BLAST programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used. See http:Wwww.ncbi.nlm.nih.gov. Another preferred, non- limiting 
example of a mathematical algorithim utilized for the comparison of sequences is the 

15 algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the 
ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM1 20 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Programs which are equivalent in terms of the results they produce can be used. 

20 

The terms "peptides", "proteins", and "polypeptides" are used interchangeably 

herein. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, e.g., 
one or more Helios polypeptides or Helios-Ikaros dimers), which is partly or entirely 

25 heterologous, i.e., foreign, to the transgenic animal or cell into which it is introduced, or, is 
homologous to an endogenous gene of the transgenic animal or cell into which it is 
introduced, but which is designed to be inserted, or is inserted, into the animal's genome in 
such a way as to alter the genome of the cell into which it is inserted (e.g., it is inserted at a 
location which differs from that of the natural gene or its insertion results in a knockout). A 

30 transgene can include one or more transcriptional regulatory sequences and any other 

nucleic acid, such as introns, that may be necessary for optimal expression of the selected 
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nucleic acid, all operably linked to the selected nucleic acid, and may include an enhancer 
sequence. 

As used herein, the term "transgenic cell" refers to a cell containing a transgene. 

As used herein, a "transgenic animal" is any animal in which one or more, and 
preferably essentially all, of the cells of the animal includes a transgene. The transgene can 
be introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. This molecule may be integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. 

As used herein, the term "tissue-specific promoter" means a DNA sequence that 
serves as a promoter, i.e., regulates expression of a selected DNA sequence, such as the 
Helios and/or Ikaros gene, operably linked to the promoter, and which effects expression of 
the selected DNA sequence in specific cells of a tissue, such as lymphocytes. The term also 
covers so-called "leaky" promoters, which regulate expression of a selected DNA primarily 
in one tissue, but cause expression in other tissues as well. 

A polypeptide has Helios biological activity if it has one or more of the following 
properties: (1) the ability to react with an antibody, or antibody fragment, specific for (a) a 
wild type Helios polypeptide, (b) a naturally-occurring mutant Helios polypeptide, or (c) a 
fragment of either (a) or (b); (2) the ability to form Helios dimers, Helios/ Aiolos, and/or 
Helios/Ikaros dimers; (3) the ability to modulate the development of hematopoietic stem 
cells; (4) the ability to stimulate transcription from a sequence; or (5) the ability to act as an 
antagonist or agonist of the activities recited in (1), (2), (3) or (4). 

"Misexpression", as used herein, refers to a non-wild type pattern of Helios gene 
expression. It includes: expression at non-wild type levels, i.e., over or under expression; a 
pattern of expression that differs from wild type in terms of the time or stage at which the 
gene is expressed, e.g., increased or decreased expression (as compared with wild type) at a 
predetermined developmental period or stage; a pattern of expression that differs from wild 
type in terms of decreased expression (as compared with wild type) in a predetermined cell 
type or tissue type; a pattern of expression that differs from wild type in terms of the 
splicing, size, amino acid sequence, post-transitional modification, stability, or biological 
activity of the expressed Helios and/or Ikaros polypeptide; a pattern of expression that 
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differs from wild type in terms of the effect of an environmental stimulus or extracellular 
stimulus on expression of the Helios and/or Ikaros gene, e.g., a pattern of increased or 
decreased expression (as compared with wild type) in the presence of an increase or 
decrease in the strength of the stimulus; a ratio of Ikaros-Ikaros dimer to Helios-Helios 

5 dimer which differs from wild type; a ratio of Helios to Helios-Helios dimer, Ikaros-Ikaros 
dimer, or Ikaros-Helios dimer that differs from wild type; a ratio of Ikaros-Helios dimer to 
Helios, Ikaros, Helios-Helios dimer, or Ikaros-Ikaros dimer that differs from wild type. 

As described herein, one aspect of the invention features a pure (or recombinant) 
nucleic acid which includes a nucleotide sequence encoding an Helios, and/or equivalents of 

10 such nucleic acids. The term "nucleic acid 11 , as used herein, can include fragments and 
equivalents. The term "equivalent" refers to nucleotide sequences encoding functionally 
equivalent polypeptides or functionally equivalent polypeptides which, for example, retain 
the ability to react with an antibody specific for an Helios polypeptide. Equivalent 
nucleotide sequences will include sequences that differ by one or more nucleotide 

15 substitutions, additions or deletions, such as allelic variants, and will, therefore, include 
sequences that differ from the nucleotide sequence of Helios shown in SEQ ID NO:l, SEQ 
ID NO:3, or SEQ ID NO:6 due to the degeneracy of the genetic code. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular biology, transgenic biology, 

20 microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are described in the literature. See, for example, Molecular Cloning A 
Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glover ed., 1985); 
Oligonucleotide Synthesis (M. J. Gait ed., 1984); Mullis et al. U.S. Patent No: 4,683,195; 

25 Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Transcription And 
Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells (R. I. 
Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. 
Perbal, A Practical Guide To Molecular Cloning (1984); the treatise, Methods In 
Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. 

30 H. Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); Methods In 

Enzymology, Vols. 154 and 155 (Wu et al. eds.), Immunochemical Methods In Cell And 
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Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Handbook Of 
Experimental Immunology, Volumes I-IV (D. M. Weir and C. C. Blackwell, eds., 1986); 
Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y., 1986). 

The Helios genes and polypeptides of the present invention are useful for studying, 
diagnosing and/or treating diseases associated with unwanted cell proliferation, e.g., 
leukemias or lymphomas. The gene (or fragment thereof) can be used to prepare antisense 
constructs capable of inhibiting expression of a mutant or wild type Helios gene encoding a 
polypeptide having an undesirable function. Alternatively, an Helios polypeptide can be 
used to raise antibodies capable of detecting proteins or protein levels associated with 
abnormal cell proliferation or lymphocyte differentiation, e.g., T cell maturation. 
Furthermore, Helios peptides, antibodies or nucleic acids, can be used to identify the stage 
of lymphocyte differentiation, e.g., the stage of T cell differntiation. 

Other features and advantages of the invention will be apparent from the following 
detailed description, and from the claims. 

Brief description of the drawings 

Figure 1 depicts an alignment of the predicted amino acid sequence of Helios with that of 
Ikaros (SEQ ID NO:7) and Aiolos (SEQ ID NO:8). The four N-terminal zinc fingers (ZF1- 
4) comprising the DNA binding domain, the C-terminal zinc fingers (ZF5-6) that mediate 
protein dimerization and the conserved transcriptional activation domain (TAD) are 
outlined. Arrows indicate the conserved sequences to which the degenerate oligos Ik-1 
(GEKPKF, Ik-F) and Ik-2 (YTIHMG, IK-R) were designed to clone the Helios gene. 

Figure 2 depicts a diagram of hemopoietic hierarchy of the progenitors and committed cells 
analyzed for Helios family gene expression. 

Figure 3 depicts the mouse Helios-1 nucleotide (SEQ ID NO:l) and amino acid (SEQ ID 
NO:2) sequences. 
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Figure 4 depicts the mouse Helios-2 nucleotide (SEQ ID NO:3) and amino acid (SEQ ID 
NOL 4) sequences. 

Figure 5 depicts the human Helios-2 nucleotide (SEQ ID NO:5) and amino acid (SEQ ID 
NO: 6) sequences. 

Figure 6 depicts an alignment of the nucleic acid sequence of mouse Helios with human 
Helios. 

Figure 7 depicts an alignment of the amino acid sequence of mouse Helios with human 
Helios. 

Detailed Description of the Invention 

Gene Therapy 

The gene constructs of the invention can also be used as a part of a gene therapy 

protocol to deliver nucleic acids encoding either an agonistic or antagonistic form of an 
Helios polypeptide. The invention features expression vectors for in vivo transfection and 
expression of an Helios polypeptide in particular cell types (e.g., dermal cells) so as to 
reconstitute the function of, enhance the function of, or alternatively, antagonize the 
function of an Helios polypeptide in a cell in which the polypeptide is expressed or 
misexpressed. 

Expression constructs of Helios polypeptide, may be administered in any 
biologically effective carrier, e.g. any formulation or composition capable of effectively 
delivering the Helios gene to cells in vivo. Approaches include insertion of the subject gene 
into viral vectors including recombinant retroviruses, adenovirus, adeno-associated virus, 
and herpes simplex virus- 1, or recombinant bacterial or eukaryotic plasmids. Viral vectors 
transfect cells directly; plasmid DNA can be delivered with the help of, for example, 
cationic liposomes (lipofectin) or derivatized (e.g. antibody conjugated), polylysine 
conjugates, gramacidin S, artificial viral envelopes or other such intracellular carriers, as 
well as direct injection of the gene construct or CaP04 precipitation carried out in vivo. 
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A preferred approach for in vivo introduction of nucleic acid into a cell is by use of a 
viral vector containing nucleic acid, e.g. a cDNA encoding an Helios polypeptide. Infection 
of cells with a viral vector has the advantage that a large proportion of the targeted cells can 
receive the nucleic acid. Additionally, molecules encoded within the viral vector, e.g., by a 

5 cDNA contained in the viral vector, are expressed efficiently in cells which have taken up 
viral vector nucleic acid. 

Retrovirus vectors and adeno-associated virus vectors can be used as a recombinant 
gene delivery system for the transfer of exogenous genes in vivo, particularly into humans. 
These vectors provide efficient delivery of genes into cells, and the transferred nucleic acids 

10 are stably integrated into the chromosomal DNA of the host. The development of 

specialized cell lines (termed "packaging cells") which produce only replication-defective 
retroviruses has increased the utility of retroviruses for gene therapy, and defective 
retroviruses are characterized for use in gene transfer for gene therapy purposes (for a 
review see Miller, A.D. (1990) Blood 76, 271). A replication defective retrovirus can be 

15 packaged into virions which can be used to infect a target cell through the use of a helper 
virus by standard techniques. Protocols for producing recombinant retroviruses and for 
infecting cells in vitro or in vivo with such viruses can be found in Current Protocols in 
Molecular Biology . Ausubel, F.M. et al. (eds.) Greene Publishing Associates, (1989), 
Sections 9.10-9.14 and other standard laboratory manuals. Examples of suitable 

20 retroviruses include pLJ, pZIP, pWE and pEM which are known to those skilled in the art. 
Examples of suitable packaging virus lines for preparing both ecotropic and amphotropic 
retroviral systems include vyCrip, i|/Cre, \j/2 and i|/Am. Retroviruses have been used to 
introduce a variety of genes into many different cell types, including epithelial cells, in vitro 
and/or in vivo (see for example Eglitis, et al. (1985) Science 230:1395-1398; Danos and 

25 Mulligan (1988) Proc. Natl Acad. Set USA 85:6460-6464; Wilson et al. (1988) Proc. Natl 
Acad. Set USA 85:3014-3018; Armentano et al. (1990) Proc. Natl. Acad. Set USA 87:6141- 
6145; Huber et al. (1991) Proc. Natl Acad. Sci. USA 88:8039-8043; Ferry et al. (1991) 
Proc. Natl Acad. Sci. USA 88:8377-8381; Chowdhury et al. (1991) Science 254:1802-1805; 
van Beusechem et al. (1992) Proc. Natl. Acad. Set USA 89:7640-7644; Kay et al. (1992) 

30 Human Gene Therapy 3:641-647; Dai et al. (1992) Proc. Natl Acad. Set USA 89:10892- 
10895; Hwu et al. (1993)7. Immunol 150:4104-4115; U.S. Patent No. 4,868,116; U.S. 
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Patent No. 4,980,286; PCT Application WO 89/07136; PCT Application WO 89/02468; 
PCT Application WO 89/05345; and PCT Application WO 92/07573). 

Another viral gene delivery system useful in the present invention utilizes 
adenovirus-derived vectors. The genome of an adenovirus can be manipulated such that it 
5 encodes and expresses a gene product of interest but is inactivated in terms of its ability to 
replicate in a normal lytic viral life cycle. See, for example, Berkner et al. (1988) 

BioTechniques 6:616; Rosenfeld et al. (1991) Science 252:431-434; and Rosenfeld et al. 
(1992) Cell 68:143-155. Suitable adenoviral vectors derived from the adenovirus strain Ad 
type 5 dl324 or other strains of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are known to those 

10 skilled in the art. Recombinant adenoviruses can be advantageous in certain circumstances 
in that they are not capable of infecting nondividing cells and can be used to infect a wide 
variety of cell types, including epithelial cells (Rosenfeld et al. (1992) cited supra). 
Furthermore, the virus particle is relatively stable and amenable to purification and 
concentration, and as above, can be modified so as to affect the spectrum of infectivity. 

15 Additionally, introduced adenoviral DNA (and foreign DNA contained therein) is not 
integrated into the genome of a host cell but remains episomal, thereby avoiding potential 
problems that can occur as a result of insertional mutagenesis in situations where introduced 
DNA becomes integrated into the host genome (e.g., retroviral DNA). Moreover, the 
carrying capacity of the adenoviral genome for foreign DNA is large (up to 8 kilobases) 

20 relative to other gene delivery vectors (Berkner et al. cited supra; Haj-Ahmand and Graham 
(1986) J. Virol 57:267). 

Yet another viral vector system useful for delivery of the subject Helios gene is the 
adeno-associated virus (AAV). Adeno-associated virus is a naturally occurring defective 
virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus 

25 for efficient replication and a productive life cycle. (For a review see Muzyczka et al. Curr. 
Topics in Micro, and Immunol (1992) 158:97-129). It is also one of the few viruses that 
may integrate its DNA into non-dividing cells, and exhibits a high frequency of stable 
integration (see for example Flotte et al. (1992) Am. J. Respir. Cell Mol. Biol 7:349-356; 
Samulski et al. (1989) J. Virol 63:3822-3828; and McLaughlin et al. (1989) J. Virol 

30 62:1963-1973). Vectors containing as little as 300 base pairs of AAV can be packaged and 
can integrate. Space for exogenous DNA is limited to about 4.5 kb. An AAV vector such 
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as that described in Tratschin et al. (1985) Mol Cell. Biol. 5:3251-3260 can be used to 
introduce DNA into cells. A variety of nucleic acids have been introduced into different 
cell types using AAV vectors (see for example Hermonat et al. (1984) Proa Natl Acad Sci 
USA 81:6466-6470; Tratschin et al (1985) Mol Cell Biol 4:2072-2081; Wondisford et al. 
(1988) Mol Endocrinol 2:32-39; Tratschin et al. (1984)J. Virol 51:611-619; and Flotte et 
al. (1993) J. Biol Chem. 268:3781-3790). 

In addition to viral transfer methods, such as those illustrated above, non- viral 
methods can also be employed to cause expression of an Helios polypeptide in the tissue of 
a mammal, such as a human. Most nonviral methods of gene transfer rely on normal 
mechanisms used by mammalian cells for the uptake and intracellular transport of 
macromolecules. In preferred embodiments, non-viral gene delivery systems of the present 
invention rely on endocytic pathways for the uptake of the subject Helios gene by the 
targeted cell. Exemplary gene delivery systems of this type include liposomal derived 
systems, poly-lysine conjugates, and artificial viral envelopes. 

In a representative embodiment, a gene encoding an Helios polypeptide can be 
entrapped in liposomes bearing positive charges on their surface (e.g., lipofectins) and 
(optionally) which are tagged with antibodies against cell surface antigens of the target 
tissue (Mizuno et al. (1992) No Shinkei Geka 20:547-551; PCT publication WO91/06309; 

Japanese patent application 1047381; and European patent publication EP-A-43075). 

In clinical settings, the gene delivery systems for the therapeutic Helios gene can be 
introduced into a patient by any of a number of methods, each of which is familiar in the art. 
For instance, a pharmaceutical preparation of the gene delivery system can be introduced 
systemically, e.g. by intravenous injection, and specific transduction of the protein in the 
target cells occurs predominantly from specificity of transfection provided by the gene 
delivery vehicle, cell-type or tissue-type expression due to the transcriptional regulatory 
sequences controlling expression of the receptor gene, or a combination thereof. In other 
embodiments, initial delivery of the recombinant gene is more limited with introduction into 
the animal being quite localized. For example, the gene delivery vehicle can be introduced 
by catheter (see U.S. Patent 5,328,470) or by stereotactic injection (e.g. Chen et al. (1994) 
PNAS 91 : 3054-3057). In a preferred embodiment of the invention, the Helios gene is 
targeted to hematopoietic cells. 
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The pharmaceutical preparation of the gene therapy construct can consist essentially 
of the gene delivery system in an acceptable diluent, or can comprise a slow release matrix 
in which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery system can be produced in tact from recombinant cells, e.g. retroviral vectors, the 
pharmaceutical preparation can comprise one or more cells which produce the gene delivery 
system. 

Antisense Therapy 

Another aspect of the invention relates to the use of the isolated nucleic acid in 
"antisense" therapy. As used herein, "antisense" therapy refers to administration or in situ 
generation of oligonucleotides or their derivatives which specifically hybridize (e.g. bind) 
under cellular conditions, with the cellular mRNA and/or genomic DNA encoding an Helios 
polypeptide, or mutant thereof, so as to inhibit expression of the encoded protein, e.g. by 
inhibiting transcription and/or translation. The binding may be by conventional base pair 
complementarity, or, for example, in the case of binding to DNA duplexes, through specific 
interactions in the major groove of the double helix. In general, "antisense" therapy refers 
to the range of techniques generally employed in the art, and includes any therapy which 
relies on specific binding to oligonucleotide sequences. 

In one embodiment, the antisense construct binds to a naturally-occurring sequence 
of an Helios gene which, for example, is involved in expression of the gene. These 
sequences include, for example, start codons, stop codons, and RNA primer binding sites. 

In another embodiment, the antisense construct binds to a nucleotide sequence 
which is not present in the wild type gene. For example, the antisense construct can bind to 
a region of an Helios gene which contains an insertion of an exogenous, non-wild type 
sequence. Alternatively, the antisense construct can bind to a region of an Helios gene 
which has undergone a deletion, thereby bringing two regions of the gene together which 
are not normally positioned together and which, together, create a non-wild type sequence. 

When administered in vivo to a subject, antisense constructs which bind to non-wild 
type sequences provide the advantage of inhibiting the expression of mutant Helios gene, 
without inhibiting expression of any wild type Helios gene. 
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An antisense construct of the present invention can be delivered, for example, as an 
expression plasmid which, when transcribed in the cell, produces RNA which is 
complementary to at least a unique portion of the cellular mRNA which encodes a Helios 
polypeptide. Alternatively, the antisense construct is an oligonucleotide probe which is 

5 generated ex vivo and which, when introduced into the cell causes inhibition of expression 
by hybridizing with the mRNA and/or genomic sequences of an Helios gene. Such 
oligonucleotide probes are preferably modified oligonucleotide which are resistant to 
endogenous nucleases, e.g. exonucleases and/or endonucleases, and is therefore stable in 
vivo. Exemplary nucleic acid molecules for use as antisense oligonucleotides are 

10 phosphoramidate, phosphothioate and methylphosphonate analogs of DNA (see also U.S. 
Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, general approaches to 
constructing oligomers useful in antisense therapy have been reviewed, for example, by Van 
der Krol et al. (1988) Biotechniques 6:958-976; and Stein et al (1988) Cancer Res 48:2659- 
2668. 

15 Accordingly, the modified oligomers of the invention are useful in therapeutic, 

diagnostic, and research contexts. In therapeutic applications, the oligomers are utilized in a 
manner appropriate for antisense therapy in general. For such therapy, the oligomers of the 
invention can be formulated for a variety of loads of administration, including systemic and 
topical or localized administration. For systemic administration, injection is preferred, 

20 including intramuscular, intravenous, intraperitoneal, and subcutaneous for injection, the 
oligomers of the invention can be formulated in liquid solutions, preferably in 
physiologically compatible buffers such as Hank's solution or Ringer's solution. In 
addition, the oligomers may be formulated in solid form and redissolved or suspended 
immediately prior to use. Lyophilized forms are also included in the invention. 

25 The compounds can be administered orally, or by transmucosal or transdermal 

means. For transmucosal or transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such penetrants are known in the art, 
and include, for example, for transmucosal administration bile salts and fusidic acid 
derivatives, and detergents. Transmucosal administration may be through nasal sprays or 

30 using suppositories. For oral administration, the oligomers are formulated into conventional 
oral administration forms such as capsules, tablets, and tonics. For topical administration, 
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the oligomers of the invention are formulated into ointments, salves, gels, or creams as 
known in the art. 

In addition to use in therapy, the oligomers of the invention may be used as 
diagnostic reagents to detect the presence or absence of the target DNA or RNA sequences 
to which they specifically bind. 

The antisense constructs of the present invention, by antagonizing the expression of 
an Helios gene, can be used in the manipulation of tissue, both in vivo and in ex vivo tissue 
cultures. 

Transgenic Animals 

The invention includes transgenic animals which include cells (of that animal) which 
contain an Helios transgene and which preferably (though optionally) express (or 
misexpress) an endogenous or exogenous Helios gene in one or more cells in the animal. 

The Helios transgene can encode a mutant Helios polypeptide. Such animals can be 
used as disease models or can be used to screen for agents effective at correcting the 
misexpression of Helios. Alternatively, the Helios transgene can encode the wild-type 
forms of the protein, or can encode homologs thereof, including both agonists and 
antagonists, as well as antisense constructs. In preferred embodiments, the expression of 
the transgene is restricted to specific subsets of cells, or tissues utilizing, for example, cis- 
acting sequences that control expression in the desired pattern. Tissue-specific regulatory 
sequences and conditional regulatory sequences can be used to control expression of the 
transgene in certain spatial patterns. Temporal patterns of expression can be provided by, 
for example, conditional recombination systems or prokaryotic transcriptional regulatory 
sequences. In preferred embodiments, the transgenic animal carries a "knockout" Helios 
gene, i.e., a deletion of all or a part of the Helios gene. 

Genetic techniques which allow for the expression of transgenes, that are regulated 
in vivo via site-specific genetic manipulation, are known to those skilled in the art. For 
example, genetic systems are available which allow for the regulated expression of a 
recombinase that catalyzes the genetic recombination a target sequence. As used herein, the 
phrase "target sequence" refers to a nucleotide sequence that is genetically recombined by a 
recombinase. The target sequence is flanked by recombinase recognition sequences and is 
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generally either excised or inverted in cells expressing recombinase activity. Recombinase 
catalyzed recombination events can be designed such that recombination of the target 
sequence results in either the activation or repression of expression of the subject Helios 
gene. For example, excision of a target sequence which interferes with the expression of a 

recombinant Helios gene, such as one which encodes an agonistic homolog, can be designed 

to activate expression of that gene. This interference with expression of the protein can 
result from a variety of mechanisms, such as spatial separation of the Helios gene from the 
promoter element or an internal stop codon. 

Moreover, the transgene can be made so that the coding sequence of the gene is 
flanked with recombinase recognition sequences and is initially transfected into cells in a 3 1 
to 5 f orientation with respect to the promoter element. In such an instance, inversion of the 
target sequence will reorient the subject gene by placing the 5 1 end of the coding sequence 
in an orientation with respect to the promoter element which allow for promoter driven 
transcriptional activation. See e.g., descriptions of the creiloxP recombinase system of 
bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 
89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et 
al. (1991) Science 251:1351-1355; PCT publication WO 92/15694). Genetic recombination 
of the target sequence is dependent on expression of the Cre recombinase. Expression of 
the recombinase can be regulated by promoter elements which are subject to regulatory 
control, e.g., tissue-specific, developmental stage-specific, inducible or repressible by 
externally added agents. This regulated control will result in genetic recombination of the 
target sequence only in cells where recombinase expression is mediated by the promoter 
element. Thus, the activation expression of the recombinant Helios gene can be regulated 
via control of recombinase expression. 

Similar conditional transgenes can be provided using prokaryotic promoter 
sequences which require prokaryotic proteins to be simultaneous expressed in order to 
facilitate expression of the transgene. Exemplary promoters and the corresponding trans- 
activating prokaryotic proteins are given in U.S. Patent No. 4,833,080. Moreover, 
expression of the conditional transgenes can be induced by gene therapy-like methods 
wherein a gene encoding the trans-activating protein, e.g. a recombinase or a prokaryotic 
protein, is delivered to the tissue and caused to be expressed, such as in a cell-type specific 
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manner. By this method, the Helios transgene could remain silent into adulthood until 
"turned on" by the introduction of the trans-activator. 

Production of Fragments and Analogs 

The inventor has provided the primary amino acid structure of an Helios 
polypeptide. Once an example of this core structure has been provided, one skilled in the 
art can alter the disclosed structure by producing fragments or analogs, and testing the 
newly produced structures for activity. Examples of prior art methods which allow the 
production and testing of fragments and analogs are discussed below. These, or analogous 
methods can be used to make and screen fragments and analogs of an Helios polypeptide 
having at least one biological activity e.g., which react with an antibody (e.g., a monoclonal 
antibody) specific for an Helios polypeptide. 

Generation of Fragments 

Fragments of a protein can be produced in several ways, e.g., recombinantly, by 
proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 
terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes 
the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments. 

Digestion with "end-nibbling" endonucleases can thus generate DNA's which encode an 

array of fragments. DNA's which encode fragments of a protein can also be generated by 
random shearing, restriction digestion or a combination of the above-discussed methods. 

Fragments can also be chemically synthesized using techniques known in the art 
such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 
peptides of the present invention may be arbitrarily divided into fragments of desired length 
with no overlap of the fragments, or divided into overlapping fragments of a desired length. 

Production of Altered DNA and Peptide Sequences: Random Methods 
Amino acid sequence variants of a protein can be prepared by random mutagenesis 
of DNA which encodes a protein or a particular domain or region of a protein. Useful 
methods include PCR mutagenesis and saturation mutagenesis. A library of random amino 
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acid sequence variants can also be generated by the synthesis of a set of degenerate 
oligonucleotide sequences. (Methods for screening proteins in a library of variants are 
elsewhere herein.) 

PCR Mutagenesis 

In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce random 
mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11-15). This is 
a very powerful and relatively rapid method of introducing random mutations. The DNA 
region to be mutagenized is amplified using the polymerase chain reaction (PCR) under 
conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerase, e.g., by 

using a dGTP/dATP ratio of five and adding Mn 2_h to the PCR reaction. The pool of 
amplified DNA fragments are inserted into appropriate cloning vectors to provide random 
mutant libraries. 

Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large number of single 
base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 229:242). This 
technique includes generation of mutations, e.g., by chemical treatment or irradiation of 

single-stranded DNA in vitro, and synthesis of a complementary DNA strand. The mutation 

frequency can be modulated by modulating the severity of the treatment, and essentially all 
possible base substitutions can be obtained. Because this procedure does not involve a 
genetic selection for mutant fragments both neutral substitutions, as well as those that alter 
function, are obtained. The distribution of point mutations is not biased toward conserved 
sequence elements. 

Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate 
oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried 
out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 
appropriate expression vector. The synthesis of degenerate oligonucleotides is known in the 
art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) 
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Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura 
etal. (19S4) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques 
have been employed in the directed evolution of other proteins (see, for example, Scott et al. 
(1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) 
Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents 
Nos. 5,223,409, 5,198,346, and 5,096,815). 

Production of Altered DNA and Peptide Sequences: Methods for Directed 
Mutagenesis 

Non-random or directed, mutagenesis techniques can be used to provide specific 
sequences or mutations in specific regions. These techniques can be used to create variants 
which include, e.g., deletions, insertions, or substitutions, of residues of the known amino 
acid sequence of a protein. The sites for mutation can be modified individually or in series, 
e.g., by (1) substituting first with conserved amino acids and then with more radical choices 
depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of 
the same or a different class adjacent to the located site, or combinations of options 1-3. 

Alanine Scanning Mutagenesis 

Alanine scanning mutagenesis is a useful method for identification of certain 

residues or regions of the desired protein that are preferred locations or domains for 
mutagenesis, Cunningham and Wells {Science 244:1081-1085, 1989). In alanine scanning, 
a residue or group of target residues are identified (e.g., charged residues such as Arg, Asp, 
His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most 
preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction 
of the amino acids with the surrounding aqueous environment in or outside the cell. Those 
domains demonstrating functional sensitivity to the substitutions are then refined by 
introducing further or other variants at or for the sites of substitution. Thus, while the site 
for introducing an amino acid sequence variation is predetermined, the nature of the 
mutation per se need not be predetermined. For example, to optimize the performance of a 
mutation at a given site, alanine scanning or random mutagenesis may be conducted at the 
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target codon or region and the expressed desired protein subunit variants are screened for 
the optimal combination of desired activity. 

Oli gonucleotide-Mediated Mutagenesis 

Oligonucleotide-mediated mutagenesis is a useful method for preparing substitution, 

deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 2:183, 1983). 
Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a mutation 
to a DNA template, where the template is the single-stranded form of a plasmid or 
bacteriophage containing the unaltered or native DNA sequence of the desired protein. 
After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the oligonucleotide primer, 
and will code for the selected alteration in the desired protein DNA. Generally, 

oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide 
will have 12 to 15 nucleotides that are completely complementary to the template on either 
side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will 
hybridize properly to the single-stranded DNA template molecule. The oligonucleotides are 
readily synthesized using techniques known in the art such as that described by Crea et aL 
(Proc. Natl Acad. Set USA, 75: 5765[1978]). 

Cassette Mutagenesis 

Another method for preparing variants, cassette mutagenesis, is based on the 
technique described by Wells et al. (Gene, 34:315 [1985]). The starting material is a 
plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 
codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique 
restriction endonuclease site on each side of the identified mutation site(s). If no such 
restriction sites exist, they may be generated using the above-described oligonucleotide- 
mediated mutagenesis method to introduce them at appropriate locations in the desired 
protein subunit DNA. After the restriction sites have been introduced into the plasmid, the 
plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the 
sequence of the DNA between the restriction sites but containing the desired mutation(s) is 
synthesized using standard procedures. The two strands are synthesized separately and then 
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hybridized together using standard techniques. This double-stranded oligonucleotide is 
referred to as the cassette. This cassette is designed to have 3* and 5 ! ends that are 
comparable with the ends of the linearized plasmid, such that it can be directly ligated to the 
plasmid. This plasmid now contains the mutated desired protein subunit DNA sequence. 

Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants, e.g., a library of 
variants which is generated by combinatorial mutagenesis at the nucleic acid level, and is 
encoded by a variegated gene library. For example, a mixture of synthetic oligonucleotides 
can be enzymatically ligated into gene sequences such that the degenerate set of potential 
sequences are expressible as individual peptides, or alternatively, as a set of larger fusion 
proteins containing the set of degenerate sequences. 

Primary High-Through-Put Methods for Screening Libraries of Peptide Fragments 
or Homologs 

Various techniques are known in the art for screening generated mutant gene 
products. Techniques for screening large gene libraries often include cloning the gene 
library into replicabie expression vectors, transforming appropriate cells with the resulting 
library of vectors, and expressing the genes under conditions in which detection of a desired 
activity, e.g., in this case, binding to an antibody specific for a Helios polypeptide. Each of 
the techniques described below is amenable to high through-put analysis for screening large 
numbers of sequences created, e.g., by random mutagenesis techniques. 

Display Libraries 

In one approach to screening assays, the candidate peptides are displayed on the 
surface of a cell or viral particle, and the ability of particular cells or viral particles to bind 
an appropriate receptor protein via the displayed product is detected in a "panning assay". 
For example, the gene library can be cloned into the gene for a surface membrane protein of 
a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 
88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 
18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for 
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potentially functional peptide homologs. Fluorescently labeled ligands, e.g., receptors, can 
be used to detect homolog which retain ligand-binding activity. The use of fluorescently 
labeled ligands, allows cells to be visually inspected and separated under a fluorescence 
microscope, or, where the morphology of the cell permits, to be separated by a 
5 fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protein on the surface of a viral particle. 
For instance, in the filamentous phage system, foreign peptide sequences can be expressed 
on the surface of infectious phage, thereby conferring two significant benefits. First, since 

these phage can be applied to affinity matrices at concentrations well over 10^3 phage per 
10 milliliter, a large number of phage can be screened at one time. Second, since each 

infectious phage displays a gene product on its surface, if a particular phage is recovered 
from an affinity matrix in low yield, the phage can be amplified by another round of 

2 infection. The group of almost identical E. coli filamentous phages M13, fd., and fl are 

■ n- i. - i 

l most often used in phage display libraries. Either of the phage gill or gVIII coat proteins 

15 can be used to generate fusion proteins without disrupting the ultimate packaging of the 

m viral particle. Foreign epitopes can be expressed at the NH2-terminal end of pill and phage 

J" bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner 

g et al. PCT publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks 

fy et al. (1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBO J 12:725-734; 

% 20 Clackson et al (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461). 

\§ A common approach uses the maltose receptor of E. coli (the outer membrane 

protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037). 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
25 available for binding to ligands, e.g., to antibodies, and can elicit an immune response when 
the cells are administered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. 
(1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 37-45), and PAL 
(Fuchs et al (1991) Bio/Tech 9, 1369-1372), as well as large bacterial surface structures 
have served as vehicles for peptide display. Peptides can be fused to pilin, a protein which 
30 polymerizes to form the pilus-a conduit for interbacterial exchange of genetic information 
(Thiry et al. (1989) Appl Environ. Microbiol 55, 984-993). Because of its role in 
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interacting with other cells, the pilus provides a useful support for the presentation of 
peptides to the extracellular environment. Another large surface structure used for peptide 
display is the bacterial motive organ, the flagellum. Fusion of peptides to the subunit 
protein flagellin offers a dense array of may peptides copies on the host cells (Kuwajima et 
al. (1988) Bio/Tech. 6, 1080-1083). Surface proteins of other bacterial species have also 
served as peptide fusion partners. Examples include the Staphylococcus protein A and the 
outer membrane protease IgA of Neisseria (Hansson et al. (1992) J. Bacteriol 174, 4239- 
4245 and Klauser et al. (1990) EMBO J. 9, 1991-1999). 

In the filamentous phage systems and the LamB system described above, the 
physical link between the peptide and its encoding DNA occurs by the containment of the 
DNA within a particle (cell or phage) that carries the peptide on its surface. Capturing the 
peptide captures the particle and the DNA within. An alternative scheme uses the DNA- 
binding protein Lad to form a link between peptide and DNA (Cull et al (1992) PNAS USA 
89:1865-1869). This system uses a plasmid containing the Lad gene with an 
oligonucleotide cloning site at its 3'-end. Under the controlled induction by arabinose, a 
Lacl-peptide fusion protein is produced. This fusion retains the natural ability of Lad to 
bind to a short DNA sequence known as LacO operator (LacO). By installing two copies of 
LacO on the expression plasmid, the Lacl-peptide fusion binds tightly to the plasmid that 
encoded it. Because the plasmids in each cell contain only a single oligonucleotide 
sequence and each cell expresses only a single peptide sequence, the peptides become 
specifically and stably associated with the DNA sequence that directed its synthesis. The 
cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix 
of immobilized receptor to recover the complexes containing active peptides. The 
associated plasmid DNA is then reintroduced into cells for amplification and DNA 
sequencing to determine the identity of the peptide ligands. As a demonstration of the 
practical utility of the method, a large random library of dodecapeptides was made and 
selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort 
of peptides was recovered, all related by a consensus sequence corresponding to a six- 
residue portion of dynorphin B. (Cull et al. (1992) Proc. Natl Acad. Sci. U.S.A. 89-1869) 

This scheme, sometimes referred to as peptides-on-plasmids, differs in two 
important ways from the phage display methods. First, the peptides are attached to the C- 
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terminus of the fusion protein, resulting in the display of the library members as peptides 
having free carboxy termini. Both of the filamentous phage coat proteins, pill and pVIII, 
are anchored to the phage through their C -termini, and the guest peptides are placed into the 
outward-extending N-terminal domains. In some designs, the phage-displayed peptides are 
presented right at the amino terminus of the fusion protein. (Cwirla, et al. (1990) Proc. Natl 
Acad. Set U.S.A. 87, 6378-6382) A second difference is the set of biological biases 
affecting the population of peptides actually present in the libraries. The LacI fusion 
molecules are confined to the cytoplasm of the host cells. The phage coat fusions are 
exposed briefly to the cytoplasm during translation but are rapidly secreted through the 
inner membrane into the periplasmic compartment, remaining anchored in the membrane by 
their C-terminal hydrophobic domains, with the N-termini, containing the peptides, 
protruding into the periplasm while awaiting assembly into phage particles. The peptides in 
the LacI and phage libraries may differ significantly as a result of their exposure to different 
proteolytic activities. The phage coat proteins require transport across the inner membrane 
and signal peptidase processing as a prelude to incorporation into phage. Certain peptides 

exert a deleterious effect on these processes and are underrepresented in the libraries 
(Gallop et al. (1994) J. Med. Chem. 37(9):1233-1251). These particular biases are not a 
factor in the LacI display system. 

The number of small peptides available in recombinant random libraries is 

enormous. Libraries of 10^-1 0^ independent clones are routinely prepared. Libraries as 

large as 10^ recombinants have been created, but this size approaches the practical limit 
for clone libraries. This limitation in library size occurs at the step of transforming the 
DNA containing randomized segments into the host bacterial cells. To circumvent this 
limitation, an in vitro system based on the display of nascent peptides in polysome 
complexes has recently been developed. This display library method has the potential of 
producing libraries 3-6 orders of magnitude larger than the currently available 
phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 
expression of the peptides, and screening, is done in an entirely cell-free format. 

In one application of this method (Gallop et al (1994) J. Med. Chem. 37(9):1233- 

1251), a molecular DNA library encoding 10^2 decapeptides was constructed and the 
library expressed in an is. coli S30 in vitro coupled transcription/translation system. 
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Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a 
substantial proportion of the RNA in polysomes and yielding complexes containing nascent 
peptides still linked to their encoding RNA. The polysomes are sufficiently robust to be 
affinity purified on immobilized receptors in much the same way as the more conventional 
recombinant peptide display libraries are screened. RNA from the bound complexes is 
recovered, converted to cDNA, and amplified by PCR to produce a template for the next 
round of synthesis and screening. The polysome display method can be coupled to the 
phage display system. Following several rounds of screening, cDNA from the enriched 
pool of polysomes was cloned into a phagemid vector. This vector serves as both a peptide 
expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing 
vector for peptide identification. By expressing the polysome-derived peptides on phage, 
one can either continue the affinity selection procedure in this format or assay the peptides 
on individual clones for binding activity in a phage ELISA, or for binding specificity in a 
completion phage ELISA (Barret, et aL (1992) Anal Biochem 204,357-364). To identify 
the sequences of the active peptides one sequences the DNA produced by the phagemid 
host. 

Secondary Screens 

The high through-put assays described above can be followed by secondary screens 
in order to identify further biological activities which will, e.g., allow one skilled in the art 
to differentiate agonists from antagonists. The type of a secondary screen used will depend 

on the desired activity that needs to be tested. For example, an assay can be developed in 
which the ability to inhibit an interaction between a protein of interest and its respective 
ligand can be used to identify antagonists from a group of peptide fragments isolated though 
one of the primary screens described above. 

Therefore, methods for generating fragments and analogs and testing them for 
activity are known in the art. Once the core sequence of a protein of interest is identified, 
such as the primary amino acid sequence of Helios polypeptide as disclosed herein, it is 
routine to perform for one skilled in the art to obtain analogs and fragments. 



Peptide Analogs of Helios 
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Peptide analogs of an Helios polypeptide are preferably less than 400, 300, 200, 150, 
130, 110, 90, 70 amino acids in length, preferably less than 50 amino acids in length, most 
preferably less than 30, 20 or 10 amino acids in length. In preferred embodiments, the 
peptide analogs of an Helios polypeptide are at least about 10, 20, 30, 50, 100 or 130 amino 
acids in length. 

Peptide analogs of an Helios polypeptide have preferably at least about 60%, 70%, 
74%, 80%, 85%, 90%, 95% or 99% homology or sequence similarity with the naturally 
occurring Helios polypeptide. 

Peptide analogs of an Helios polypeptide differ from the naturally occurring Helios 
polypeptide by at least 1, 2, 5, 10 or 20 amino acid residues; preferably, however, they 
differ in less than 15, 10 or 5 amino acid residues from the naturally occurring Helios 
polypeptide. 

Useful analogs of an Helios polypeptide can be agonists or antagonists. Antagonists 
of an Helios polypeptide can be molecules which form the Helios-Ikaros dimers but which 
lack some additional biological activity such as transpriptional activation of genes that 
control lymphocyte development. Helios antagonists and agonists are derivatives which can 
modulate, e.g., inhibit or promote, lymphocyte maturation and function. 

A number of important functional Helios domains have been identified by the 
inventors. This body of knowledge provides guidance for one skilled in the art to make 
Helios analogs. One would expect nonconservative amino acid changes made in a domain 
to disrupt activities in which that domain is involved. Conservative amino acid changes, 
especially those outside the important functional domains, are less likely to modulate a 
change in activity. A discussion of conservative amino acid substitutions is provided 
herein. 

The general structure of Helios and Ikaros proteins is very similar, and four blocks 
of sequence are particularly well conserved. The first block of conservation encodes the 
zinc finger modules contained in the Ik-1 isoform which mediate DNA binding of the Ikaros 
protein (Molnar et al. (1994) Mol Cell Biol 14 8292-8303). The second block of 
conservation has not been characterized functionally. 

The third block of conservation a highly conserved 81 amino acid sequence which 
has been shown to mediate transcriptional activity of the Ikaros proteins. This activation 
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domain of Ikaros is composed of a stretch of acidic amino acids followed by a stretch of 
hydrophobic residues, both of which are required for its full activation potential. This 
domain from Ikaros alone or the full length Ikaros protein confers transcriptional activity of 
a fusion protein with the LexA DNA binding domain. This example shows that the 
homologous domain in Helios is also a transcriptional activation domain in yeast and 
mammalian cells and that the Helios transcriptional activation domain provides stronger 
transcriptional activity than the homologous domain from Ikaros in mammalian cells. The 
results show that the 232 C-terminal amino acids of Helios is capable of conferring 
transcriptional activation in yeast cells. No activity was detected with the 149 most C- 
terminal amino acids of Helios, which do not contain the conserved domain. 

The fourth block of conservation corresponds to the zinc fingers which mediate 
dimerization. A C-terminal 149 amino acids of Helios which contain the two terminal zinc 
finger domains mediate protein dimerization. 

Antibodies 

The invention also includes antibodies specifically reactive with a subject Helios 
polypeptide or Helios-Ikarod dimers. Anti-protein/anti-peptide antisera or monoclonal 
antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory 
Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a 
mouse, a hamster or rabbit can be immunized with an immunogenic form of the peptide. 
Techniques for conferring immunogenicity on a protein or peptide include conjugation to 
carriers or other techniques well known in the art. An immunogenic portion of the subject 
Helios polypeptide can be administered in the presence of adjuvant. The progress of 
immunization can be monitored by detection of antibody titers in plasma or serum. 
Standard ELISA or other immunoassays can be used with the immunogen as antigen to 
assess the levels of antibodies. In a preferred embodiment, the subject antibodies are 
immunospecific for antigenic determinants of the Helios-Iakros dimers or Helios 
polypeptide of the invention, e.g. antigenic determinants of a polypeptide of SEQ ID NO:2, 
SEQ ID NO:4, or SEQ ID NO:6. 

The term "antibody", as used herein, intended to include fragments thereof which 
are also specifically reactive with an Helios polypeptide or Helios-Ikaros dimers. 
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Antibodies can be fragmented using conventional techniques and the fragments screened for 
utility in the same manner as described above for whole antibodies. For example, F(ab')2 
fragments can be generated by treating antibody with pepsin. The resulting F(ab')2 

fragment can be treated to reduce disulfide bridges to produce Fab' fragments. 

Both monoclonal and polyclonal antibodies (Ab) directed against Helios-Ikaros 
dimers or Helios polypeptides, or fragments or analogs thereof, and antibody fragments 
such as Fab' and F(ab')2 ? can be used to block the action of an Helios and/or Ikaros 

polypeptide and allow the study of the role of an Helios polypeptide of the present 
invention. 

Antibodies which specifically bind Helios-Ikaros dimers or Helios polypeptide 
epitopes can also be used in immunohistochemical staining of tissue samples in order to 
evaluate the abundance and pattern of expression of Helios-Ikaros dimer or Helios 
polypeptide. Anti-Helios polypeptide antibodies can be used diagnostically in immuno- 
precipitation and immuno-blotting to detect and evaluate wild type or mutant Helios 

polypeptide levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, 

the ability to monitor Helios-Ikaros dimer or Helios polypeptide levels in an individual can 
allow determination of the efficacy of a given treatment regimen for an individual afflicted 
with disorders associated with modulation of lymphocyte differentiation and/or 
proliferation. The level of an Helios-Ikaros dimer or Helios polypeptide can be measured in 

tissue, such as produced by biopsy. 

Another application of anti-Helios antibodies of the present invention is in the 
immunological screening of cDNA libraries constructed in expression vectors such as 

gtll, gtl8-23, ZAP, and O RF 8 . Messenger libraries o f this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion proteins. 
For instance, gtll will produce fusion proteins whose amino termini consist of 6- 
galactosidase amino acid sequences and whose carboxy termini consist of a foreign 
polypeptide. Antigenic epitopes of a subject Helios polypeptide can then be detected with 
antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with 
anti-Helios polypeptide antibodies. Phage, scored by this assay, can then be isolated from 
the infected plate. Thus, the presence of Helios homologs can be detected and cloned from 
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other animals, and alternate isoforms (including splicing variants) can be detected and 
cloned from human sources. 

Drug Screening- Assays 

By making available purified and recombinant-Helios polypeptides, the present 
invention provides assays which can be used to screen for drugs which are either agonists or 
antagonists of the normal cellular function, in this case, of the subject Helios polypeptide. 
In one embodiment, the assay evaluates the ability of a compound to modulate binding 
between an Helios polypeptide and a naturally occurring ligand, e.g., an antibody specific 
for a Helios polypeptide or an Ikaros polypeptide. A variety of assay formats will suffice 
and, in light of the present inventions, will be comprehended by skilled artisan. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often preferred 
as "primary" screens in that they can be generated to permit rapid development and 
relatively easy detection of an alteration in a molecular target which is mediated by a test 
compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead being focused 
primarily on the effect of the drug on the molecular target as may be manifest in an 
alteration of binding affinity with other proteins or change in enzymatic properties of the 
molecular target. 

Other Embodiments 

Included in the invention are: allelic variations; natural mutants; induced mutants; 
proteins encoded by DNA that hybridizes under high or low stringency conditions to a 
nucleic acids which encode polypeptides of SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6 
(for definitions of high and low stringency see Current Protocols in Molecular Biology, 
John Wiley & Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by reference); and, 
polypeptides specifically bound by antisera to an Helios polypeptide. 
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Nucleic acids and polypeptides of the invention includes those that differ from the 
sequences discolosed herein by virtue of sequencing errors in the disclosed sequences. 

Also included in the invention is a composition which includes an Helios 
polypeptide, e.g., an Helios/Helios dimer or an Helios/Ikaros peptide, and one or more 

5 additional components, e.g., a carrier, diluent, or solvent. The additional component can be 
one which renders the composition useful for in vitro, in vivo, pharmaceutical, or veterinary 
use. Examples of in vitro use are binding studies. Examples of in vivo use are the induction 
of antibodies. 

The invention also includes fragments, preferably biologically active fragments, or 
10 analogs of an Helios polypeptide. A biologically active fragment or analog is one having 
any in vivo or in vitro activity which is characteristic of the Helios polypeptide shown in 
SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, or of other naturally occurring Helios 
polypeptides, e.g., one or more of the biological activities described above. Especially 
preferred are fragments which exist in vivo, e.g., fragments which arise from post 

1 5 transcriptional processing or which arise from translation of alternatively spliced RNA's. 

Fragments include those expressed in native or endogenous cells, e.g., as a result of post- 
radiational processing, e.g., as the result of the removal of an amino-terminal signal 
sequence, as well as those made in expression systems, e.g., in CHO cells. Because 
peptides, such as an Helios polypeptide, often exhibit a range of physiological properties 

20 and because such properties may be attributable to different portions of the molecule, a 
useful Helios polypeptide fragment or Helios polypeptide analog is one which exhibits a 
biological activity in any biological assay for Helios polypeptide activity. Most preferably 
the fragment or analog possesses 10%, preferably 40%, or at least 90% of the activity of an 
Helios polypeptide (SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6), in any in vivo or in 

25 vitro Helios polypeptide activity assay. 

Analogs can differ from a naturally occurring Helios polypeptide in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 
include in vivo or in vitro chemical derivatization of an Helios polypeptide. Non-sequence 
modifications include changes in acetylation, methylation, phosphorylation, carboxylation, 

30 or glycosylation. 
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Preferred analogs include an Helios polypeptide (or biologically active fragments 
thereof) whose sequences differ from the wild-type sequence by one or more conservative 
amino acid substitutions or by one or more non-conservative amino acid substitutions, 

deletions, or insertions which do not abolish the Helios polypeptide biological activity. 
Conservative substitutions typically include the substitution of one amino acid for another 
with similar characteristics, e.g., substitutions within the following groups: valine, glycine; 
glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, 
glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Other 
conservative substitutions can be taken from the table below. 



TABLE 1 

CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino 
Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D- 
homo-Arg, Met, He, D-Met, D-Ile, 
Orn, D-Orn 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, 
D-Gln 


Aspartic Acid 


D 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, 
D-Gln 


Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, 
D-Thr 
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Glutamine 


Q 


D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, 
D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, 
D-Gln 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro, p-Ala 
Acp 


Isoleucine 


I 


D-Ile, Val, D-Val, Leu, D-Leu, Met, 
D-Met 


Leucine 


L 


D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D- 
homo-Arg, Met, D-Met, He, D-Ile, 
Orn, D-Orn 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, Leu, 
D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D- 
His, Trp, D-Trp, Trans-3,4, or 5- 
phenylproline, cis-3,4, 
or 5-phenylproline 


Proline 


P 


D-Pro, L-I-thioazolidine-4- 
carboxylic acid, D-or L-l- 
oxazolidine-4-carboxylic acid 
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Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met, 
D-Met, Met(O), D-Met(O), L-Cys, D- 
Cys 


Threonine 




D-Thr, Ser, D-Ser, allo-Thr, Met, 
D-Met, Met(O), D-Met(O), Val, D-Val 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D- 
His 


Valine 


V 


D-Val, Leu, D-Leu, He, D-Ile, Met, 
D-Met 



Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 

5 (which replace the peptide bonds) in the peptide sequence. Also included are: analogs that 
include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non- 
naturally occurring or synthetic amino acids, e.g., p or y amino acids; and cyclic analogs. 

As used herein, the term "fragment", as applied to an Helios polypeptide analog, will 
ordinarily be at least about 20 residues, more typically at least about 40 residues, preferably 

10 at least about 60 residues in length. Fragments of an Helios polypeptide can be generated 
by methods known to those skilled in the art. The ability of a candidate fragment to exhibit 
a biological activity of an Helios polypeptide can be assessed by methods known to those 
skilled in the art, as described herein. Also included are Helios polypeptides containing 
residues that are not required for biological activity of the peptide or that result from 

15 alternative mRNA splicing or alternative protein processing events. 

In order to obtain an Helios polypeptide, an Helios polypeptide-encoding 
DNA can be introduced into an expression vector, the vector introduced into a cell suitable 
for expression of the desired protein, and the peptide recovered and purified, by prior art 
methods. Antibodies to the peptides an proteins can be made by immunizing an animal, 
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e.g., a rabbit or mouse, and recovering anti-Helios polypeptide antibodies by prior art 
methods. 

Examples: 

Identification of Helios, a novel Tkaros-related g ene 

In order, to identify a novel Ikaros-related factor a PCR-based approach was used. 
Degenerate primers, GEKPFK and YTIHMG, encoding conserved sequences in the Ikaros 
N-terminal (Ik-F) and C-terminal zinc finger (Ik-R) domains (Turpen et al., Immunity, 
7:325-334, 1997) were used to amplify cDNAs generated from the spleen of Aiolos mutant 
mice. A PCR product of the expected 980 base pair size was cloned and shown to have 
unique DNA sequence with homology to the Ikaros gene. Full length coding sequence was 
obtained by RACE PCR using nested specific internal primers. Nested gene specific primers 
were as follows: 5"-RACE R51: GGGTGAAGGCCTCAGGT (SEQ ID NO:9) and R52: 
CCATCATATGAGACTGCATCAGCTCCAGCCTCC (SEQ IDNO:10); 3' -RACER31: 
GGAGGCTGAGCTGATGCACTCTCATATGATGG (SEQ ID NO: 11) and R32: 
CACCTACCTTGGAGCTGAGGCCCTTCACCC (SEQ ID NO: 12). The RACE PCR was 
performed using the Marathon cDNA Amplification Kit (Clonetech, Palo Alto, CA) and 
TaKaRa LA Taq DNA polymerase (Takara Shuzo, Shiga, Japan). The amplification 

conditions were 1.5 min. at 95°C for 1 cycle, 20 seconds at 98°C and 2.5 minutes at 72°C 
for 5 cycles, 20 seconds at 98°C and 2.5 minutes at 70°C for 5 cycles, 30 seconds at 98°C 
and 2.5 minutes at 68°C for 32 cycles, and 1 cycle of 10 minutes at 72°C. A second round 
of amplification using nested primers was performed using a portion of the first product as 
template. The second amplification was L5 minutes at 95°C for 1 cycle, 20 seconds at 98°C 
and 2.5 minutes at 68°C for 20 cycles, followed by 72°C for 10 minutes for 1 cycle. 5 1 and 
3 5 products were cloned into the pGEM-T Easy vector (Promega, Madison, Wisconsin) and 
sequenced PCR analysis of Helios expression in hematopoietic cells using various 
combination of specific 5" and 3" primer pairs routinely yielded two bands. These two 
bands were cloned and sequenced to show that the two alternatively spliced transcripts 
differed in the presence of sequence encoding the first N-terminal zinc finger. 

The encoded protein, designated Helios, shows a high degree of conservation to 
Ikaros and Aiolos (73% and 67% similarity overall, respectively) (Figure 1). The three 
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proteins are nearly identical throughout the N-terminal zinc finger DNA-binding domain. 
There is a 93% identity between Helios and Ikaros from the first through the fourth zinc 
fingers and 88% identity between the same regions of Helios and Aiolos. The protein 
dimerization domain, comprising the C-terminal zinc fingers is 86% identical between 
Helios and Ikaros and 75% identical between Helios and Aiolos. In a third region, that 
contains the transcriptional activation domain, Helios shares 68% similarity to Ikaros and 
70% identity to Aiolos. As mentioned above, two alternatively spliced forms of Helios 
were identified by PCR from thymus cDNA. Sequence analysis of the two Helios isoforms 
revealed that they encode products that differ in the number of N-terminal zinc fingers. The 
full length isoform (Hel-1) is analogous to Ikaros isoform Ik-1 in that it contains all four 
DNA-binding zinc fingers. The second isoform (Hel-2) is similar to Ik-2 in that it is 
missing zinc finger 1, although the exon removed to generate Ik-2 includes additional 
sequence N-terminal to the zinc finger that is retained in the Hel-2 isoform. PCR analysis 
using various combinations of primer pairs revealed no other isoforms that migrate at 
approximately 64 and 66 kDa, as described below. No other proteins are detected by 
Western blot analysis of thymocyte nuclear extracts using an affinity purified polyclonal 
antibody against Helios. The strong conservation of the N-terminal zinc finger motifs of 
Hel-1 and Hel-2 with Ikaros isoforms Ik-1 and Ik-2 predicts that they will display similar 
affinities and DNA binding specificities. 

5' and 3' RACE strategies were used to clone the ends of the human Helios cDNA 
after cloning of the internal section of this message using degenerate primers from the 5' 
and 3' zinc finger regions. The 3' untranslated sequence extends for an additional 3 kb. The 
human clone encodes a protein which is identical to mouse Helios. The nucleotide and 
inferred protein sequences of mouse and human Helios were compared using the GCG 
Bestfit program. 

Expression of Helios during embryogenesis 

The expression of Helios during mouse embryogenesis was examined by in situ 
hybridization. Ikaros expression was analyzed in an adjacent section at each stage for 
comparison. In situ hybridization, including embryo preparation, probe synthesis and in 
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situ hybridization, was carried out essentially as described [Ikeda, Dev. Dynamics, 20:318- 
329, 1996]. Four micrometer sections were prepared from embryonic days 8, 11, 13 and 16 
and were hybridized with single stranded [33P]UTP labeled antisense RNA probes specific 
to each gene. Slides were exposed for 5 weeks, stained with hematoxylin and eosin and 
analyzed with both bright and dark field microscopy. 

Helios was found to be expressed in all hematopoietic centers of the developing 
embryo. The blood islands of the yolk sac constituted the first site of embryonic 
hematopoiesis. Helios and Ikaros were expressed in this extraembryonic site at day 8 of 
gestation. However, by day 1 1 , Helios expression was significantly decreased, while Ikaros 
expression was maintained through embryonic day 13 in this region. Both Helios and 
Ikaros were expressed in the liver at day 1 1 ; however, Helios mRNA was present in a subset 
of cells in this tissue. Throughout hematopoietic development, Helios expression in the 
liver was detected in a small number of scattered cells. In contrast, Ikaros was expressed at 
high levels in most of the cells present in this tissue during mid to late gestation. In the 
thymus, Helios was first detected at low levels at embryonic day 13, while Ikaros 
expression was readily detected in this site two days earlier. By day 16, Helios was 
expressed at high levels toward the center of the thymus, a region where early progenitors 
enter from the vasculature. In contrast, Ikaros was detected in most thymocytes. This 
pattern of Helios expression was maintained in the postnatal thymus. Helios was also 
detected in a small subset of cells within the spleen of the adult. Within the splenic 
germinal centers of an immunized animal, a small number of cells expressed moderate 
levels of Helios, while Ikaros was present at high levels throughout these centers. Their 
localization suggested that these may be CD4 + TH2 cells. 

Outside of the hematopoietic system, Helios expression was high in a number of 
epithelial tissues. These include the endoderm lining the gut, the tubules of the kidney, the 
lining of the respiratory tract and olfactory epithelium. During late gestation high levels of 
Helios expression were detected in the salivary glands and ducts. 

The expression of Helios in adult tissues was examined by Northern blot analysis of 
polyA+ selected RNAs using the region between the N- and C-terminal zinc fingers of 
Helios as a probe. Northern blot analysis and RT-PCR were carried out essentially as 
follows. A 980 bp cDNA was used as a probe for Northern analysis. This probe did not 
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cross react with Ikaros or Aiolos, which yield transcripts of distinct sizes. The blot had 
previously been screen with a GAPDH probe to confirm equivalent loading of RNA 
samples. Northern results showed that a transcript of approximately 8 kb was detected in 
thymus. At various times during embryogenesis, Helios was expressed in the lung, liver, 
kidney and brain; however, Helios mRNA was not detected by Northern analysis in these 
tissues in the adult. The Helios probe did not cross react with either Ikaros or Aiolos that 
encode more abundant messages of distinct sizes in the thymus and spleen. 

Expressiop of Helios in Hematopoietic Subpopulations 

The expression of Ikaros gene family members in sorted hematopoietic and 
lymphoid progenitors of the adult was examined by RT-PCR using specific primer pairs for 
Helios, Ikaros or Aiolos. RT-PCR conditions and Ikaros and Aiolos primers were carried 
out. HPRT primers (for.TGGCCCTCTGTGGTGCTCAAG (SEQ ID NO: 13); 
Rev:CACAGGACTAGAACACCTGC (SEQ ID NO; 14) were used as a control for RNA 
recovery. For analysis of Helios expression in hematopoietic cells, the following primer 
pairs were used: Forward (2F): GAACACGCCAATATGGCC (SEQ ID NO: 15) 
(nucleotides 60-78 of Helios cDNA) and Reverse (8R): GGCGTTGGTAGCATCCAAAGC 
(SEQ ID NO: 16) (nucleotides 1327-47 of Helios cDNA). For PCRs, primers 125F: 

AGAATGTCAGCATGGAGGCT (SEQ ID NO: 17) (nucleotides 707-726) and 8R were 
used for amplification. This forward primer is downstream of the region encoding the first 
zinc finger and therefore, only amplifies one Helios isoform. In all cases, the annealing 
temperature was 60°C and amplification was determined to be in the linear range. For bone 
marrow derived progenitor populations where cells were limiting in number, cDNA from 50 
cells equivalents was amplified for 32 cycles. For thymocyte precursors, amplification was 
done 1000 cell equivalents for 26 cycles for each primer pair, For other samples, e.g., 
Ikaros, 25 cycles were done and for Helios and Aiolos 28 cycles were done. 

The subsets of hemo-lymphoid populations used for these studies and their ontogeny 
are diagrammed in Figure 2. Stem cell population (ckit + Sca-1 + , lineage"), early progenitors 
(ckit + Sca-1', lineage" and ckit + SCA-rSca2 + , lineage") were purified from the bone marrow 
of wild type mice . Lineage committed erythroid (lerl 19 + ), pre B (B220 + ), granulocyte 
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(Mac] + , GR*) monocyte/macrophage (Macl + , GR* populations were purified form bone 
marrow of wild type mice using antibodies to cell surface markers, magnetic secondary 
antibodies and separated using a MACS magnetic separation column. Pro B cells were 
B220 + sorted from the bone marrow of Rag-/- mice, mature B cells were B220 + form the 
spleens of wild type mice. Splenocyte form -Rag-/- mice were depleted of red cells and 
used an enriched source of NK Thymic and splenic dendritic cells were purified. Double 
positive (CD4 + CD8 + ) and single positive (CD4 + or CD8 + ) were sorted form wild type 
thymus and soluble negative (CD4 CD8) were obtained form thymocyte of Rag-/- mice that 
are arrested at this state of differentiation. Developmental stages of double thymocyte 
(CD4 10 , ckit + CD25 + , ckit-CD25 + , ckit, CD25) were sorted to 98-99% purity. 

Helios mRNA was detected in the bone marrow progenitor population that was 
highly enriched for stem cell activity (ckit + /Sca-l+lineage-) and was also present in 
hematopoietic progenitors with more restricted lymphoid or erythro-myeloid potential 
(ckit+/Sca-l-r/Sca-2+ and ckit+/Sca-l-/Sca-2- respectively). Ikaros displays a similar 
pattern of expression in these hematopoietic progenitor populations whereas Aiolos was 
detected only in the progenitors that were more committed to lymphoid development 
(ckit+/Sca-l+/Sca-2+). 

Helios was expressed in definitive erythroid precursors (terl 19+) and very low 
levels of Helios mRNA are present within the monocyte (macl+GR-) and granulocyte 
(Macl+GR+) population sin the adult bone marrow. Ikaros, but not Aiolos, was detected at 
low levels in all three of these cell types. Helios was present at low levels in pro-B cells 
(CD45R+/CD43+), and decreases as they progress to pre-B cells (CD45R+/IgM+). In 
contrast, Aiolos expression was low in pro-B cells and dramatically increases in pre-B and 
mature B cells. 

As HSCs differentiate along the myeloerythroid and B lymphoid lineages, Helios 
expression was diminished. However, Helios was present at varying levels in all T cell 
subsets analyzed. The earliest lymphoid progenitors entering the thymus are CD4 10 (and 
ckit+) and are not necessarily committed to the T cell lineage. Helios and Ikaros are both 
detected in these earliest lymphoid progenitors. An increase in Helios was apparent during 
the progressive transition to the ckit+CD25+, and then ckit-CD25+, where Aiolos was first 
detected. A marked increase in Aiolos levels was observed at the next stage (ckit-CD25-), 
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while Helios expression decreases. Ikaros levels remain constant during these early stages 
of T-cell differentiation. For comparison, the expression of these genes in RNA from total 
thymocyte populations of wild type and Rag-/- mice was done. In Rag-/- mice the majority 
of thymocytes are at the ckit4-CD25+ stage where T cell development was blocked, while in 

5 a wild type thymus, the majority of cells are at the later, double positive (CD4+CD8+) 
stage. Helios mRNA increases as T cells progress from the CD4-CD8- double negative to 
the CD4+CD8+ double positive stage and declines as these become single positive (CD4+ 
or CD8+) thymocytes. Peripheral T cells have lower expression of Helios than immature 
thymocytes with the highest levels detected among y5T cells of the skin (Vy8) and the gut 

10 (IEL). Ikaros and Aiolos are present in these T cell populations but Aiolos was not detected 
in the fetally derived skin y5 T cells. All three genes are expressed in NK cells. The 
lymphoid derived thymic dendritic cells (DC) as well the splenic CD8+ and CD8- dendritic 
subsets express very low levels of Helios. Ikaros was present in all three populations, but 
was highest in the splenic CD8-DC subset. Among the DC subpopulations, Aiolos was also 

15 highest in the splenic DC8-DCs. 

The expression of two Helios isoforms was routinely detected by PCR using a 5' 

primer preceding the first zinc finger. These isoforms correspond to the Hel-1 and Hel-2 

cDNAs and are expressed at roughly equivalent levels in all cell types tested. As described 
previously, no significant difference in the ratio of Ikaros isoforms in different 
20 hematopoietic populations can be detected under our conditions, where amplification was 
determined to be within the linear range. In all hematopoietic cell types analyzed, Ik-1 an 
Ik-2 were expressed in highest abundance, while Ik-4 and 5 were expressed at low levels. A 
faint band corresponding to Ik-6 was also detected in all populations tested. 

During hematopoietic development, Helios, Ikaros and Aiolos have overlapping but 
25 distinct patterns of expression. The differential patterns of expression of these three factors 
within the hematopoietic system may underscore their specific regulatory roles during 
differentiation. 

The expression of Helios at the sites where HSCs arise suggest that this gene is an 
important regulators of the earlier stages of hematopoietic development. Hematopoietic 
30 progenitors accumulate in the yolk sac at day 8 and the fetal liver in day 1 1 . Both Ikaros 
and Helios are expressed in similar numbers of cells in these regions at early states. As 
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gestation continues these sites are increasingly populated by more committed erythroid 
progenitors as well. While Ikaros expression increases in both sites, Helios was only 
expressed in a limited number of cells. This may reflect its preferential expression in the 
less committed hematopoietic progenitors in the embryo. The expression of Helios in 
sorted hematopoietic cells in the adult supports this interpretation. Helios is expressed in 
adult HSCs but its expression decreases in the maturing erythroid, macrophage and B- 
lymphocyte lineages. Helios expression peaks in the early stages of T-cell development and 
decrease as T cells mature in the thymus and are exported to the periphery. Significant 
levels of Helios are maintained in only a small subset of mature T-cells. Upon 
immunization, Helios is detected in a very small number of cells in germinal centers of the 
spleen. When compared with that of Ikaros and Aiolos, this profile of Helios expression 

suggests that transcriptional complexes including Ikaros and Helios will predominate in the 
earlier stages of hematopoiesis. This combination may be important for the self-renewing 
capacity of early progenitors that is compromised in the Ikaros DN homozygous mice. The 
increasing expression of Aiolos and Ikaros as development proceeds may lead to complexes 
that promote lineage progression and differentiation. 

While Ikaros and Aiolos are predominately expressed in the hematopoietic system, 
Helios is also expressed elsewhere in the embryo. Based on this observation, it likely that 
the Ikaros gene family regulates lineage progression on other tissues as will. The dynamic 
expression of Helios in the embryo is consistent with such a role. Mutational analysis of the 

Helios gene will help to dissect its role in regulating progenitor development in the 

hematopoietic system and elsewhere in the embryo. 

Helios forms homodimers and heterdimerizes with Ikaros and Aiolos 

The C-terminal zinc fingers of Ikaros and Aiolos, shown to mediate their homo- and 
heterodimerization, are highly conserved in Helios. Helios-specific polyclonal antibodies 
were generated to study the interactions between the Helios isoforms and the Ikaros and 
Aiolos proteins. Generation of Helios-specific polyclonal antibodies was carried out as 
follows. The region between the N- and C-terminal zinc fingers of Helios was amplified by 
PCR and cloned in frame into a pRSET vector (Invitrogen, Carlsbad, CA). The protein was 
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expressed in BL21 E.coli and denatured protein was purified on a nickel affinity column as 
recommended by the manufacturer (Invitrogen, Carlsbad, CA). Rabbit polyclonal 
antibodies raised to this protein were affinity purified by pH elution. Specificity of this 
antibody for Helios and not other Ikaros homologs was confirmed by Western blot analysis 
of protein extracts from transfected 293T cells and by immunofluorescence of transfected 
cells. For Western analysis, protein lysates were taken up in IX Laemmli sample buffer, 
heated at 95°C for 15 minutes and resolved on a 10% SDS-PAGE gel. Resolved proteins 
were transferred to an Immobilon-P membrane that was probed with the affinity purified 
polyclonal Helios antibodies (1/500 dilution in PBS, 0.05% TWEEN-20). To detect Helios 
in primary cells, signal was amplified by incubation of the filter with a 1/5000 dilution of 
biotinylated Goat a-rabbit antibody followed by the same dilution of peroxidase coupled 
streptavidin (Jackson labs). The ECL kit (Amersham, Uppsala, Sweden) was used for 

detection. 

The antibody generated against Helios recognized the two Helios isoforms in 
thymocyte nuclear extracts from wild type, Ikaros null, and Ikaros DN+/- mutant mice. The 
Helios isoforms detected in thymocytes were approximately 64 and 68 kDa, and co- 
migrated with the proteins produced by the Hel-1 and Hel-2 cDNAs when co-expressed in 

the epithelial cell line 293T . 

To determine whether Helios physically interacts with Ikaros in primary cells, we 
used cell lysates from the thymuses of mice transgenic for an epitope-tagged (FLAG) 
tagged Ik-7 isoform expressed from the CD2 minigene. Ik-7 was the predominant isoform 
produced by the Ikaros DN mutant locus and lacks the DNA binding domain, but has intact 
C-terminal dimerization zinc finger motifs. Complexes were immunoprecipitated from 
thymic whole cell lysates using a mouse monoclonal antibody specific for the FLAG 
epitope. Western blot analysis using the Helios polyclonal antibody revealed the presence 
of both Helios isoforms in the immunoprecipitated complexes. Thus, the Ikaros DN protein 
formed a stable protein complex with Helios protein isoforms and may interfere with their 

normal activity in vivo. 

To examine more closely the ability of Helios isoforms Hel-1 and Hel-2 to form 
dimers with self, as well as with Ikaros and Aiolos, these factors were transiently expressed 
in 293 T cells in pairwise combinations. Transient expression of Ikaros and Aiolos in 293T 
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cells was carried out as follows. Full length Hel-1 or the Hel-2 isoforms were amplified by 
PCR from thymocyte cDNA using primers generated to the 5 ! or 3 1 ends (5 ! 
AATTGAATTCATGCACTGCACTTTGACTATGG (SEQ ID NO: 18) and 3 r R: 
TTTTCCTTTTGCGGCCGCATGTCGCCATCCGAGGGAAGG SEQ ID NO: 19) and 

5 cloned into the CDM8 mammalian expression vector between the EcoRI and Notl sites 
(CDM8-Hel-1, CDM8-Hel-2). Additional constructs were generated having the FLAG or 
hemagglutanim (HA) tags (FLAG-Hel-1, FLAG-HE 1-2, HA-HE1, HA-Hel-2). The clones 
were sequenced to confirm no mutations were introduced and that they were in frame with 
epitope tags. 293T cells were transfected with 10p,g of each cDNA. After two days, cells 

10 from each 10 cm plate were harvested in 0.5 ml lysis buffer, 10 jLtl of extract was used to 
confirm expression of each protein by Western blot analysis, and 100 \il of extract 
precleared with protein G-agarose followed by immunoprecipitation with anti-FLAG M5 
affinity gel. After washing, beads were resuspended in 2X Laemmli sample buffer and 
incubated for 15 minutes at 95°C. The beads were spun down and one third of the 

15 supernatant was resolved on a 10% SDS-PAGE gel. Western blot analysis was carried out 
as described above except that for 293T extracts, incubation with affinity purified 
polyclonal antibodies specific for Ikaros or Helios was followed by incubation with 
peroxidase coupled Goat-a-rabbit secondary antibody. For immunoprecipitation from 
primary cells, thymocyte or splenocyte were obtained for transgenic mice expressing the 

20 FLAG-tagged dominant negative mutant Ikaros isoform Ik-7 from the CD2 minigene. Cells 
were harvested and washed in PBS/2% FCS. Protein extracts were prepared by lysis of 
1X1 0 7 cells per 100 jil lysis buffer. 

As mentioned above, to determine whether Helios isoforms Hel-1 and Hel-2 can 
form dimers with self, as well as with Ikaros and Aiolos, these factors were transiently 

25 expressed in 293 T cells in pairwise combinations. One protein in each expressed pair was 
epitope tagged (FLAG). After two days, cell lysates were prepared and Western blot 
analysis confirmed protein expression using antibodies specific for each of the Helios, 
Ikaros and Aiolos proteins. An antibody to the epitope tag (anti-FLAG) was used to 
immunoprecipitate complexes from 293T cell lysates, and precipitated complexes were 

30 analyzed for protein interactions using Ikaros or Helios specific antibodies. The anti FLAG 
antibody co-precipitates both FLAG-Hel-1 and Hel-2, demonstrating that the two isoforms 
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can dimerize. A similar strategy was used to study Helios, Ikaros and Aiolos interactions. 
FLAG-Hel-1 or FLAG-Hel-2 were co-expressed with Ik-1 . The anti-FLAG antibody 
brought down IK-1 in an immunoprecipitated complex in both cases. To control for the 
specificity of the Helios/Ikaros protein interactions, the IkMl (Ik-1 mutant) was also used in 
these assays. IkMl encodes two point mutations in the C-terminal zinc fingers of Ikaros 
that disrupt the ability to dimerize. In contrast to Ik-1, this dimerization deficient form of 
Ikaros was unable to interact with either Helios isoform. Finally, cells were co-transfected 

with FLAG- Aiolos and either Hel-1 or Hel-2 to show that each Helios isoform can form 
heterodimers with Aiolos.. These studies show that the C-terminal zinc fingers in Helios, 
Ikaros and Aiolos are functionally conserved and mediate the stable interactions between 
these proteins which may be critical for hematopoiesis as well as lymphocyte differentiation 
and function. 

Helios is part of a higher order nuclear structure that contains Ikaros and Aiolos 

Our studies with Ikaros and Aiolos have shown that these proteins are part of a 
higher order structure in resting lymphocytes that undergoes dramatic changes upon 
activation. To determine whether Helios also participates in these nuclear macromoiecular 
structures we examined its subcellular localization in primary lymphoid cells by confocal 
immunofluorescence microscopy. Primary thymocyte or splenocyte were obtained. 
Thymocyte were activated for 40 hours on plated precoated with 20 jag/ml CD3. Cells were 
harvested and washed in phosphate buffered saline (PBS). 1X10 5 cells were cytospun per 
slide and fixed 4% paraformaldehyde, 0.5% TWEEN in PBS at 4°C and then washed in 
PBS. Prior to antibody incubation, cells were blocked for 1 hour in 3%BSA, 1% goat 
serum, 1% donkey serum in PBS. Slides were then incubated with 1/50 dilution of primary 
affinity purified a Helios antibody in blocking buffer overnight at 4°C ? followed by a 60 
min. incubation at room temperature with Sng/jxl biotinylated goat a-rabbit IgG followed 
by a 60 min. incubation at room temperature with 5ng/jal biotinylated goat a rabbit IgG 
(Jackson labs). Each antibody incubation step was followed by 3 washes in PBS. A 45 

min. incubation with 5ng/l|ul avidin-FITC (Southern Biotechnology Associates) in 1% 
dialyzed FCS/3%BSA in PBS was done for detection. For double staining, an overnight 
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4°C incubation with affinity polyclonal Aiolos directly couple to the Alexa 568 flourophore 
(Molecular Probes) was done as the final step. For triple staining of Helios, Aiolos and the 
FLAG tagged IK-7 in cells from transgenic mice, were additionally incubated for 60 
minutes at RT with 5ng/l]Lil of an anti-FLAG M5 monoclonal antibody (Kodak, washed 
and then incubated for 60 minutes with a 5ng/|il Cy5 coupled goat anti-mouse antibody. 
Specific staining was visualized by confocal immunofluorescence microscopy . 

Resting or activated thymocytes and splenocytes isolated from wild-type mice were 
prepared for these confocal studies. In contrast to Ikaros and Aiolos, bright staining for 
Helios was detected only in a small number of either resting or activated thymocytes 
(approximately 1 in 25 cells). In these few cells, Helios was detected in a punctate pattern 
within the nucleus, similar to that previously described for Ikaros and Aiolos . Upon 
thymocyte activation, Helios was redistributed into ring like structures in the nucleus, as are 
Ikaros and Aiolos. Helios was also detected in a very small number of splenocytes. The 
cells are likely to be T or NK cells, as RT-PCR analysis indicated that Helios was not 
expressed at significant levels in mature B cells, myeloid or erythroid cells. 

To determine potential co-localization of these proteins in higher order structures, 
splenocytes were double stained for Aiolos and Helios. Although most cells in the spleen 
express Aiolos, a small subset of splenocytes express Helios as well. In most cases, there is 
complete overlap of these two proteins in a punctate pattern with the nucleus. However, 
there are a few small spots where either Helios or Aiolos is detected alone. In addition, a 
few cells were observed that showed bright staining for Helios, but only faint staining for 
Aiolos. Cells stained for Ikaros and Helios showed a similar co-localization of the proteins. 

To further investigate the nuclear localization of Helios with Ikaros and Aiolos, T 
cells from the spleen of mice expressing the FLAG-Ik-7 transgene were used. The FLAG- 
epitope was utilized in triple staining studies to examine the localization of Ik-7 with the 
endogenous Helios and Aiolos proteins. In cells of young animals, these three proteins co- 
localize within nuclear structures, similar to that observed in wild type cells. 

These studies establish the presence of all three family members in the same 
structures within the nucleus and demonstrate that Ikaros DN mutant proteins have the 
potential to interfere with the activity of the endogenous Helios and Aiolos proteins by co- 
localization within the same macromolecular nuclear structures. As inferred from the 
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expression profiles of sorted cells, this immunofluorescence data also confirms the co- 
expression of different Ikaros family proteins in varying combinations within cells of 
distinct sub-populations in the thymus and spleen 

Helios can function as Transcriptional Activator 

Ikaros and Aiolos have been shown to function as positive transcriptional regulators 
upon ectopic expression in mammalian cells . The transcriptional activation domains of 
both proteins were identified using yeast one hybrid assays, and they were found to function 
similarly in mammalian cells. Helios protein exhibits conservation to the transcriptional 
activation domain of Ikaros and Aiolos. Given the near identity in the DNA binding 
domain between Helios and Ikaros, we tested the ability of Helios to activate transcription 
from Ikaros binding sties. The expression of a reporter gene under the control of four high 
affinity Ikaros binding sites (IkBS2) was tested in the presence of Helios or Ikaros in 
NIH3T3 cells. Both proteins were shown to increase expression of the reporter gene over 
background levels (Figure 3). A five fold increase was detected in the presence of Helios 
while a 7.8 fold increase was detected in the presence of Ikaros. This transcriptional 
activation mediated by Helios requires the Ikaros consensus binding sites. These results 
confirm the functional conservation of both the DNA binding and transcriptional activation 

domains. 
Helios 

The present invention identifies and characterizes Helios, a new member of the 
Ikaros gene family. The proteins encoded by all three genes in the Ikaros family share 
grossly similar properties mediated by conserved functional domains. All three bind to the 
consensus DNA binding sites characterized for Ikaros and activate transcription form an 
adjacent promoter in co-transfection assays. Like Aliolos and Ikaros, Helios can dimerize 
with itself as well as other family members including a dominant negative isoform of 
Ikaros. Although the conservation of these domains emphasizes the similarity of these 
proteins, other regions differ between the proteins encoded by these genes and may confer 
functional specificity among them. The fact that the regions that diverge between family 
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members are conserved in the orthologues of these genes in other species supports their 

functional significance. 

The preferential expression of Helios in the earliest stages of the hemopoietic 
lineages suggests that gene may exert its predominant function in early progenitor cells. 
The facts that a dominant negative Ikaros protein causes defects in the HSC and that Helios 
is the only identified target of this protein expressed at this stage of the lineage imply a 
crucial role for Helios in HSC development. The expression of Helios outside the 
hemopoietic system may indicate a role for the Ikaros gene family in progenitor 
development in other tissues as well 

Equivalents 

Those skilled in the art will be able to recognize, or be able to ascertain using no 
more than routine experimentation, numerous equivalents to the specific procedures 

described herein. Such equivalents are considered to be within the scope of this invention 

and are covered by the following claims. 
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Claims: 

1 . A substantially pure nucleic acid comprising a nucleotide sequence which is at 
least 60% identical to the nucleotide sequence of SEQ ID NO:l, 3, or 5. 

2. The nucleic acid of claim 1 comprising the nucleotide sequence of SEQ ID NO:l, 

3 or 5. 

3. The nucleic acid of claim 1, wherein the nucleic acid hybridizes under high 
stringency conditions to a nucleic acid which encodes a Helios polypeptide, or a 
complement thereof. 

4. The nucleic acid of claim 1, wherein the nucleic acid encodes a polypeptide that 
reacts with an antibody specific for a Helios polypeptide. 

5. A substantially pure nucleic acid which encodes a fragment of a Helios 
polypeptide of at least 60 amino acids in length. 

6. A substantially pure polypeptide which is at least 60% homologous to an Helios 
polypeptide. 

7. The Helios polypeptide of claim 6 comprising the amino acid sequence of SEQ 
ID NO:2, 4, or 6. 

8. The pure preparation of claim 6, wherein the Helios polypeptide has the 
following properties: 

(a) it can form a dimer with an Helios, Aiolos, or Ikaros polypeptide; 

(b) it is expressed in hematopoietic stem cells; 

(c) it has a molecular weight of approximately 64 kDa or 68 KDa; 

(d) it has at least one zinc finger domain; and 

(e) it is a transcriptional activator of a lymphoid gene. 
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9. A fragment of the Helios polypeptide of claim 8 which is at least 50 amino 
acids in length. 

10. A vector comprising the nucleic acid of any of claims 1 -5 . 

11. A cell comprising the nucleic acid of any of claims 1-5 and 10. . 

12. A purified preparation of an anti-Helios antibody. 

13. A method for manufacture of an Helios peptide comprising culturing the cell 
of claim 1 1 in a medium to express the Helios polypeptide. 

14. A method of making an Helios polypeptide, having at least one biological 
activity of a naturally occurring Helios polypeptide including altering the sequence, of one 
or more residues and testing the altered polypeptide for the desired activity. 

15. A method for treating an animal for a disorder comprising administering a 
therapeutically-effective amount of an Helios polypeptide, a cell selected for the expression 
of a product of the Helios gene, or a nucleic acid encoding an Helios peptide to the animaL 

16. A method for determining if a subject is at risk for a disorder related to mis- 
expression of the Helios gene comprising examining the subject for the expression or 
structure of the Helios gene, non-wildtype structure or expression being indicative of risk. 

17. A transgenic animal having an Helios transgene. 
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Abstract 

The invention features an Helios polypeptide. 
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ATGGAAACAGACGCTATTGATGGCTATATAACATGTGACAATGAGCTTTCACCCGAAGGGGAACACGCCA 
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TACCTTTGTCTGCGATAACTACCGATATATTGTACACTGTTACTCGAAAGTGGGCTTCCCCTTGTGCGGT 
METDAIDGYITCDNELSPEGEHA 

ATATGGCCATTGACCTCACCTCAAGCACGCCCAATGGACAGCACGCCTCGCCAAGTCACATGACAAGCAC 
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TATACCGGTAACTGGAGTGGAGTTCGTGCGGGTTACCTGTCGTGCGGAGCGGTTCAGTGTACTGTTCGTG 
N ft A I OLTSSTPNGQHASPSHMTST 

AAATTCTGTAAAGCTGGAAATGCAGAGTGATGAAGAGTGTGACAGGCAGCCCCTGAGCCGTGAGGATGAG 
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TTTAAGACATTTCGACCTTTACGTCTCACTACTTCTCACACTGTCCGTCGGGGACTCGGCACTCCTACTC 
NSVKLEMQSOEECDRQPLSREDE 

ATCAGGGGCCACGATGAGGGGAGCAGCCTAGAAGAACCCCTAATTGAGAGCAGCGAGGTGGCCGACAACA 
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TAGTCCCCGGTGCTACTCCCCTCGTCGGATCTTCTTGGGGATTAACTCTCGTCGCTCCACCGGCTGTTGT 
IRGHDEGSSLEEPLIESSEVADN 

GGAAAGTCCAGGACCTTCAAGGCGAGGGAGGAATCCGGCTTCCGAATGGTAAACTGAAATGTGACGTCTG 
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CCTTTCAGGTCCTGGAAGTTCCGCTCCCTCCTTAGGCCGAAGGCTTACCATTTGACTTTACACTGCAGAC 
RKVQDLQGEGG I RLPNGKLKCDVC 

TGGCATGGTTTGCATTGGGCCCAATGTGCTTATGGTACATAAAAGGAGTCACACTGGTGAGCGGCCCTTC 
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ACCGTACCAAACGTAACCCGGGTTACACGAATACCATGTATTTTCCTCAGTGTGACCACTCGCCGGGAAG 
GMVC I GPNVLMVHKRSHTGERPF 

CACTGTAACCAGTGCGGAGCTTCTTTTACCCAGAAGGGCAACCTTCTGAGACAEATAAAGTTACACTCTG 
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GTGACATTGGTCACGCCTCGAAGAAAATGGGTCTTCCCGTTGGAAGACTCTGTGTATTTCAATGTGAGAC 
HCNQCGASFTQKGNLLRH I KLHS 

GAGAGAAGCCCTTCAAATGTCCTTTCTGTAGCTATGCTTGTAGAAGAAGGGACGCTCTCACAGGACACCT 
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CTCTCTTCGGGAAGTTTACAGGAAAGACATCGATACGAACATCTTCTTCCCTGCGAGAGTGTCCTGTGGA 

* 

GEKPFKCPFCSYACRRRDALTGHL 

CAGGACCCATTCTGTGGGTAAACCTCACAAGTGTAACTACTGTGGCCGAAGCTACAAGCAGCGCACGTCA 
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GTCCTGGGTAAGACACCCATTTGGAGTGTTCACATTGATGACACCGGCTTCGATGTTCGTCGCGTGCAGT 
RTHSVGKPHKCNYCGRSYKQRTS 
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GACCTCCTTGTGTTCCTTGCGACAGTGTTGATAGAGGTCTTACAGTCGTACCTCCSACSGCCCGTCCAGT 
LEEHKERCHNYLQ NVS M EAAGQV 

TGAGTCACCATGrACCGCCTATGGAAGATTGTAAGGAACAAGAGCCTATCATGGACAACAAT ATTTCTCT 

.... I , , [ ■ - • i ■ ■ ■ | ■ | .,, i .... | . . .. i . .. i I p . i i ■ i ■ ■ ■ ■ [ 77Q 

ACTCAGTGGTACATGGCGGATACCTTCTAACATTCCTTGTTCTCGGATAGTACCTGTTGTTATAAAGAGA 

MSHHVPPMEDCKEQEPIMDNNISL 
CRTRCCTTTTGAGAGACCTGCTGTCATAGAGAAGCTCACGGCAAATATGGGAAAG CGCAAAAGCTCCACT 
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CCACGGAAAACTCTCTGGACGACAGTATCTCTTCGAGTGCCGTTTATACCCTTTCGCGTTTTCGAGGTGA 
VP F E R P AV I EKLTANMGKR KS.S T 

CCTCAGAAGTTTGTGGGGGAAAAGCTTATGCGATTCAGCTACCCAGATATTCATTTTGATATGAACTTAA 
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GGAGTCTTCAAACACCCCCTTTTCGAATACGCTAAGTCGATGGGTCTATAAGTAAAACTATACTTGAATT 

PQKFVGEKLMRFSYPDIHFDMNL 
CATATGAGAAGGAGGCTGAGCTGATGCAGTCTCATATGATGGACCAAGCCATCAACAATGCAATCACCTA 
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GTATACTCTTCCTCCGACTCGACTACGTCAGAGTATACTACCTGGTTCGGTAGTTGTTACGTTAGTGGAT 
TYEKEAELMQSHMMDQA INMA I T Y 

1050 

GGAACCTCGACTCCGGGAAGTGGGAGACTACGTCGTACGTGGTTCGTGTTAGCGACTCCACCGGGGTCAA 
LGAEALHPLMQHAPSTIAEVAPV 

ATAAGCTCAGCTTATT CTCAGGTCTATCATCCAAACAGGATAGAAAGACCCATTAGCA6GGAAACATCTG 

TATTCGAGTCGAATAAGAGTCCAGATAGTAGGTTTGTCCTATCTTTCTGGGTAATCGTCCCTTTGTAGAC 
ISSAYSQVYHPNRIERPISRETS 

AT&GTCACGAAAACAACATGGATGGCCCCATCTCTCTCATCAGACCAAAGAGTCGACCCCAGGAAAGAGA 
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TATCAGTGCTTTTGTTGTACCTACCGGGGTAGAGAGAGTAGTCTGGTTTCTCAGCTGG6GTCCTTTCTCT 

DSHENNH0GP1SLIRPKSRPQERE 

GGCCTCGCCCAGCAATAGCTGCCTCGATTCTACTGACTCAGAAAGTAGCCATGATGACCGCCAGTCCTAC 

CCGGAGCGGGTCGTTATCGACGGAGCTAAGATGACTGAGTCTTTCATCGGTACTACTGGCGGTCAGGATG 
ASPSNSCLDSTDSESSHDDRQSY 
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CAAGGAAACCCTGCCTTAAATCCCAAGAGGAAACAAAGCCCAGCTTACATGAAGGAGGATGTCAAGGCTT 
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GTTCCTTTGGGACGGAATTTAGGGTTCTCCTTTGTTTCGGGTCGAATGTACTTCCTCCTACAGTTCCGAA 
q g N P A L N P K R K Q S P A Y H" K E D V K A 

TGGATGCTACCAAGGCCCCCAAGGGCTCTCTGAAGGACATCTATAAGGTTTTCAATGGAGAAGGAGAACA ... 
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ACCTACGATGGTTCCGGGGGTTCCCGAGAGACTTCCTGTAGATATTCCAAAAGTTACCTCTTCCTCTTGT 
LDATKAPKGSLKD I YKVFNGEGEQ 

GATAAGGGCCTTCAAGTGTGAGCACTGCCGAGTCCTTTTTCTAGACCATGTCATGTACACCATTCACATG 
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CTATTCCCGGAAGTTCACACTCGTGACGGCTCAGGAAAAAGATCTGGTACAGTACATGTGGTAAGTGTAC 

I RAFKCEHCRVLFLDHVMYT I H M 

GGTTGCCATGGCTACCGGGACCCACTGGAATGCAACATCTGTGGCTACAGAAGCCAGGACCGCTACGAAT 
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CCAACGGTACCGATGGCCCTGGGTGACCTTACGTTGTAGACACCGATGTCTTCGGTCCTGGCGATGCTTA 

GCHGYRD PLECNI CGYRSQDRYE 
TTTCATCACACATTGTTGGGGGGCAGCACACATTCCACTAGGCGTTTGCATTCCAAGG 

AAAGTAGTGTGTAACAACCCCCCGTCGTGTGTAAGGTGATCCGCAAACGTAAGGTTCC 
PSSH1 VGGQHTFH. AFAFQG 
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ATGGAAACAGACGCTATTGATGGCTATATAACATGTGACAATGAGCTTTCACCCGAAGGGGAACACGCCA 
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TACCTTTGTCTGCGATAACTACCGATATATTGTACACTGTTACTCGAAAGTGGGCTTCCCCTTGTGCGGT 
METDAIDGYI TCONE LSPEGEHA 

ATATGGCCATTGACCTCACCTCAAGCACGCCCAATGGACAGCACGCCTCGCCAAGTCACATGACAAGCAC 

TATACCGGTAACTGGAGTGGAGTTCGTGCGGGTTACCTGTCGTGCGGAGCGGTTCAGTGTACTGTTCGTG 
N M A I'DLTSSTPNGQHASPSHMTST 

AAATTCTGTAAAGCTGGAAATGCAGAGTGATGAAGAGTGTGACAGGCAGCCCCTGAGCCGTGAGGATGAG 
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TTTAAGACATTTCGACCTTTACGTCTCACTACTTCTCACACTGTCCGTCGGGGACTCGGCACTCCTACTC 
NSVKLEMQSOEECDRQPLSREDE 

ATCAGEGGCCACGATGAGGGGAGCAGCCTAGAAGAACCCCTAATTGAGAGCAGCGAGGTGGCCGACAACA 
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ITAGTCCCCGGTGCTACTCCCCTCGTCGGATCTTCTTGGGGATTAACTCTCGTCGCTCCACCGGCTGTTGT 

II RGHDEGSSLEEPLIESSEVAON 

i 

\ GGAAAGTCCAGGACCTTCAAGGCGAGGGAGGAATCCGGCTTCCGAATGGTGAGCGGCCCTTCCACTGTAA 
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CCTTTCAGGTCCTGGAAGTTCCGCTCCCTCCTTAGGCCGAAGGCTTACCACTCGCCGGGAAGGTGACATT 
RKVQDLQGEGG IRLPNGERPFHCN 

CCAGTGCGGAGCTTCTTTTACCCAGAAGGGCAACCTTCTGAGACACATAAAGTTACACTCTGGAGAGAAG 
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GGTCACGCCTCGAAGAAAATGGGTCTTCCCGTTGGAAGACTCTGTGTATTTCAATGTGAGACCTCTCTTC 
QCGASFT QKGNLLRH1 KLHSGEK 

CCCTTCAAATGTCCTTTCTGTAGCTATGCTTGTAGAAGAAGGGACGCTCTCACAGGACACCTCAGGACCC 
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GGGAAGTTTACAGGAAAGACATCGATACGAACATCTTCTTCCCTGCGAGAGTGTCCTGTGGAGTCCTGGG 
PFKCPFCSYACRRRDALTGHLRT 

ATTCTGTGGGTAAACCTCACAAGTGTAACTACTGTGGCCGAAGCTACAAGCAGCGCACGTCACTGGAGGA 
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TAAGACACCCATTTGGAGTGTTCACATTGATGACACCGGCTTCGATGTTCGTCGCGTGCAGTGACCTCCT 
HSYGKPHKCNYCGRSYKQRTSLEE 

ACACAAGGAACGCTGTCACAACTATCTCCAGAATGTCAGCATGGAGGCTGCCGGGCAGGTCATGAGTCAC 
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TGTGTTCCTTGCGACAGTGTTGATAGAGGTCTTACAGTCGTACCTCCGACGGCCCGTCCAGTACTCAGTG 
HKERCHNYLQNVSMEAAGQVMSH 
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CATGTACCGCCTATGGAAGATTGTAAQGAACAAGAGCCTATCATCGACAACAATATTTCTCTGGTGCCTT 
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GTACATGGCGGATACCTTCTAACATTCCTTGTTCTCGGATAGTACCTGTTGTTATAAAGAGACCACGGAA 
HVPPMEDCKEQEP IMDN N I SLVP 

TTGAGAGACCTGCTGTCATAGAGAAGCTCACGGCAAATATGGGAAAGCGCAAAAGCTCCACTCCTCAGAA _ - 
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AACTCTCTGGACGACAGTATCTCTTCGAGTGCCGTTTATACCCTTTCGCGTTTTCGAGGTGAGGAGTCTT 
FERPAV I EKLTANMGKRKSSTPQK 

GTTTGTGGGGGAAAAGCTTATGCGATTCAGCTACCCAGATATTCATTTTGATATGAACTTAACATATGAG 
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CAAACACCCCCTTTTCGAATACGCTAAGTCGATGGGTCTATAAGTAAAACTATACTTGAATTGTATACTC 
FVGEKLM RFSYPDIHFDMNLTYE 

AAGGAGGCTGAGCTGATGCAGTCTCATATGATGGACCAAGCCATCAACAATGCAATCACCTACCTTGGAG 
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TTCCTCCGACTCGACTACGTCAGAGTATACTACCTGGTTCGGTAGTTGTTACGTTAGTGGATGGAACCTC 
KEAELHQSHMNDO A I NNA I TY L G 

CTGAGGCCCTTCACCCTCTGATGCAGCATGCACCAAGCACAATCGCTGAGGTGGCCCCAGTTATAAGCTC 
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GACTCCGGGAAGTGGGAGACTACGTCGTACGTGGTTCGTGTTAGCGACTCCACCGGGGTCAATATTCGAG 

AEALHPLflGHAPSTIAEYAPV ISS 

AGCTTATTCTCAGGTCTATCATCCAAACAGGATAGAAAGACCCATTAGCAGGGAAACATCTGATAGTCAC 
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TCGAATAAGAGTCCAGATAGTAGGTTTGTCCTATCTTTCTGGGTAATCGTCCCTTTGTAGACTATCAGTG 
AYSQVYHPNRIERPISRETSDSH 

GAAAACAACATGGATGGCCCCATCTCTCTCATCAGACCAAAGAGTCGACCCCAGGAAAGAGAGGCCTCGC 
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CTTTTGTTGTACCTACCGGGGTAGAGAGAGTAGTCTGGTTTCTCAGCTGGGGTCCTTTCTCTCCGGAGCG 

ENNHDGP I SL I RPKSRPQEREAS 

CCAGCAATAGCTGCCTCGATTCTACTGACTCAGAAAGTAGCCATGATGACCGCCAGTCCTACCAAGGAAA 

GGTCGTTATCGACGGAGCTAAGATGACTGAGTCTTTCATCGGTACTACTGGCGGTCAGGATGGTTCCTTT 
PSNSCLDSTDSESSHDDRQSY QGN 

CCCTGCCTTAAATCCCAAGAGGAAACAAAGCCCAGCTTACATGAAGGAGGATGTCAAGGCTTTGGATGCT 
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GGGACGGAATTTAGGGTTCTCCTTTGTTTCGGGTCGAATGTACTTCCTCCTACAGTTCCGAAACCTACGA 
PALNPKRKQSPAYMKEDVKALDA 
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ACCAAGGCCCCCAAGGGCTCTCTGAAGGACATCTATAAGGTTTTCAATGGAGAAGGAGAACAGATAAGGG 
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TGGTTCCGGGGGTTCCCGAGAGACTTCCTGTAGATATTCCAAAAGTTACCTCTTCCTCTTGTCTATTCCC 
TKAPKGSLKD I YKVFN G E G E Q I R 

CCTTCAAGTGTGAGCACTGCCGAGTCCTTTTTCTAGACCATGTCATGTACACCATTCACAXGGGTTGCCA ... 
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GGAAGTTCACACTCGTGACGGCTCAGGAAAAAGATCTGGTACAGTACATGTGGTAAGTGTACCCAACGGT 
AFKCEHCRVLFLDHVMYT I HMGCH 

TGGCTACCGGGACCCACTGGAATGCAACATCTGTGGCTACAGAAGCCAGGACCGCTACGAATTTTCATCA 
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ACCGATGGCCCTGGGTGACCTTACGTTGTAGACACCGATGTCTTCGGTCCTGGCGATGCTTAAAAGTAGT 
GYRDPLECNICGYRSQDRYEFSS 

CACATTGTTGGGGGGCAGCACACATTCCACTAGGCGTTTGCATTCCAAGG 
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GTGTAACAACCCCCCGTCGTGTGTAAGGTGATCCGCAAACGTAAGGTTCC 
H IVGGQHTFH. AFAFQG 
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1/1 31/11 

GCC CGG GCA GGT CGC ATT GCT ATA GCA CTC ACT GAC CTC TCT CTC TCT CTT TTT TTT CCT 
ARAGRIAIALTDLSLSLFFP 
61/21 91/31 

CTT TCC TGA AAC CCG ACA TTG TCA CCT CCT CTT TGA GGG TTA GAA GAA GCT GAG ATC TCC 
LS*NPTLSPPL*GLESAEIS 
121/41 151/51 

CGA CAG AGC TGG AAA TGG TGA TGA ATC TTT TTT AAT CAA AGG ACA ATT TCT TTT CAT TGC 
RQSWKW* * IFFNQRTI SFHC 
181/61 211/71 

ACT TTG ACT ATG GAA ACA GAG GCT ATT GAT GGC TAT ATA ACG TGT GAC AAT GAG CTT TCA 
TLTMETEAIDGYITCDNELS 
241/81 271/91 

CCC GAA AGG GAG CAC TCC AAT ATG GCA ATT GAC CTC ACC TCA AGC ACA CCC AAT GGA CAG 
PEREHSNMAIDLTSSTPNGQ 
301/101 331/111 

CAT GCC TCA CCA AGT CAC ATG ACA AGC ACA GAT TCA GTA AAG CTA GAA ATG CAG AGT GAT 
HASPSHMTSTDSVKLEMQSD 
361/121 391/131 

GAA GAG TGT GAC AGG AAA CCC CTG AGC CGT GAA GAT GAG ATC AGG GGC CAT GAT GAG GGT 
EECDRKPLSREDEIRGHDEG 
Cfel/141 451/151 

=jgpC AGC CTA GAA GAA CCC CTA ATT GAG AGC AGC GAG GTG GCT GAC AAC AGG GAA GTC CAG 
raSLEEPLIESSEVADNREVQ 
:S81/161 511/171 

'~UaG CTT CAA GGC GAG GGA GGA ATC CGG CTT CCG AAT GGT AAA CTG AAA TGT GAC GTC TGT 

*Slqgeggirlpngklkcdvc 

||41/131 571/191 

fiGGC ATG GTT TGC ATT GGG CCC AAT GTG CTT ATG GTA CAT AAA AGG AGT CAC ACT GGT GAA 
V H MVC IGPNVLMVHKRSHTGE 
^01/201 631/211 

3 CGC CCC TTC CAC TGT AAC CAG TGT GGA GCT TCT TTT ACT CAG AAG GGC AAC CTT CTG AGA 
fjl PFHCNQCGASFTQKGNLLR 
JfeSl/221 691/231 

: ; !&AC ATA AAG TTA CAC TCT GGA GAG AAG CCG TTC AAA TGT CCT TTC TGT AGT CAC GCC TGT 
-iHIKLHSGEKPFKCPFCSHAC 
31721/241 751/251 

; ff\GA AGA AGG GAC GCC CTC ACA GGA TAC CTC AGG ACC CAT TCT GTG GGT AAA CCT CAC AAG 
'St RRDALTGYLRTHSVGKPHK 
'^81/261 811/271 
TGC AAC TAC TGT GGA CGA AGC TAC AAG CAG CGC AGT TCA CTG GAG GAG CAC AAG GAA CGC 
CNYCGRSYKQRSSLEEHKER 
841/281 871/291 

TGC CAC AAC TAT CTC CAG AAT GTC AGC ATG GAG GCT GCT GGG CAG GTC ATG AGT CAC CAT 
CHNYLQNVSMEAAGQVMSHH 
901/301 931/311 

GTA CCT CCT ATG GAA GAT TGT AAG GAA CAA GAG CCT ATT ATG GAC AAC AAT ATT TCT CTG 
VPPMEDCKEQEPIMDNNISL 
961/321 991/331 

GTG CCT TTT GAG AGA CCT GCT GTC ATA GAG AAG CTC ACG GGG AAT ATG GGA AAA CGT AAA 
VPFERPAVI EKLTGNMGKRK 
1021/341 1051/351 

AGC TCC ACT CCA CAA AAG TTT GTG GGG GAA AAG CTC ATG CGA TTC AGC TAC CCA GAT ATT 
SSTPQKFVGEKLMRFSYPDI 
1081/361 1111/371 

CAC TTT GAT ATG AAC TTA ACA TAT GAG AAG GAG GCT GAG CTG ATG CAG TCT CAT ATG ATG 
HFDMNLTYEKEAELMQSHMM 
1141/381 1171/391 

GAC CAA GCC ATC AAC AAT GCA ATC ACC TAC CTT GGA GCT GAG GCC CTT CAC CCT CTG ATG 

DQAINNAITYLGAEALHPLM 
1201/401 1231/411 

CAG CAC CCG CCA AGC ACA ATC GCT GAA GTG GCC CCA GTT ATA AGC TCA GCT TAT TCT CAG 
QHPPSTIAEVAPVISSAYSQ 
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GTC TAT CAT CCA AAT AGG ATA GAA AGA CCC 

VYHPNRIERP 

1321/441 

AAC AAC ATG GAT GGC CCC ATC TCT CTC ATC 

NNMDGPISLI 

1381/461 

GCC TCT CCC AGC AAT AGC TGC CTG GAT TCC 

ASPSNSCLDS 

1441/481 

CAG TCC TAC CAA GGA CAC CCT GCC TTA AAT 

QSYQGHPALN 

1501/501 

AAG GAG GAT GTC AAA GOT TTG GAT ACT ACC 

KEDVKALDTT 

1561/521 

TAC AAG GTC TTC AAT GGG GAA GGA GAA CAG 

YKVFNGEGEQ 

1621/541 

GTC CTT TTC CTA GAC CAT GTC ATG TAC ACC 

VLFLDHVMYT 

1681/561 

CCA CTG GAA TGT AAC ATC TGT GGC TAC AGA 

PLECMICGYR 

1741/531 

ATT GTT CGA GGG GAG CAC ACA TTC CAC TAG 

^JVRGEHTFH* 

Q.801/601 

\;jgTA AAG ACT GCA CAT GAA GAA ATA CTG CAC 
ffV" KTAHEEILH 
J3.861/621 

"tpCT TTT ATT TTT TTA ATA TAA TAC TGG TGA 
-ft>FIFLI*YW* 
hi- 921/641 
f f£TC TGC T 
. «L C 



1291/431 

ATT AGC AGG GAA ACT GCT GAT AGT CAT GAA 

ISRETADSHE 

1351/451 

AGA CCA AAG AGT CGA CCC CAG GAA AGA GAG 

RPKSRPQERE 

1411/471 

ACT GAC TCA GAA AGC AGC CAT GAT GAC CAC 

TDSESSHDDH 

1471/491 

CCC AAG AGG AAA CAA AGC CCA GCT TAC ATG 

PKRKQSPAYM 

1531/511 

AAG GCT CCT AAG GGC TCT CTG AAG GAC ATC 

KAPKGSLKDI 

1591/531 

ATT AGG GCC TTC AAG TGT GAG CAC TGC CGA 

IRAFKCEHCR 

1651/551 

ATT CAC ATG GGT TGC CAT GGC TAC CGG GAC 

IHMGCHGYRD 

1711/571 

AGC CAG GAC CGT TAT GAG TTT TCA TCA CAC 

SQDRYEFSSH 

1771/591 

GCC TTT TCA TTC CAA AGG GGA CCC TAT GAA 
AFSFQRGPYE 

1831/611 

TTA CAA TCC CAC CTT TCC TCA AAT GTT GTA 

LQSHLSSNVV 

1891/631 

TAA TCT TAT TTT GTG GAG CAG TGT CAT TTG 
* SYFVEQCHL 
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1 ATGGAAACAGACGCTATTGATGGCTATATAACATGTGACAATGAGCTTTC 5 0 
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190 ATGGAAACAGAGGCTATTGATGGCTATATAACGTGTGACAATGAGCTTTC 239 

***** 

51 ACCCGAAGGGGAACACGCCAATATGGCCATTGACCTCACCTCAAGCACGC 10 0 
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240 AC C CGAAAGGGAGC ACT C CAATATGGC AATT GAC CTC AC CTCAAGCAC AC 2 89 



101 C CAATGGACAGCAcGC CT CGC C AAGTCACATGACAAGCACAAATTCTGTA 150 
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290 C C AATGGACAGCATGC CTCAC CAAGTCACATGACAAGCACAGATT C AGT A 3 3 9 



151 AAGCTGGAAATGCAGAGTGATGAAGAGTGTGACAGGCAGCCCCTGAGCCG 2 00 
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3 90 TGAAGATGAGATCAGGGGCCATGATGAGGGTAGCAGC CTAGAAGAAC C C C 439 
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251 TAATTGAGAGCAGCGAGGTGGCCGACAACAGGAAAGTCCAGGACCTTCAA 3 00 
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440 TAATTGAGAGCAGCGAGGTGGCTGACAACAGGGAAGTCCAGGAGCTTCAA 4 89 



3 01 GGCGAGGGAGGAATCCGGCTTCCGAATGGTAAACTGAAATGTGACGTCTG 350 
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490 GGCGAGGGAGGAATCCGGCTTCCGAATGGTAAACTGAAATGTGACGTCTG 53 9 



351 TGGCATGGTTTGCATTGGGCCCAATGTGCTTATGGTACATAAAAGGAGTC 400 
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540 TGGCATGGTTTGCATTGGGCCCAATGTGCTTATGGTACATAAAAGGAGTC 589 



401 ACACTGGTGAGCGGCCCTTCCACTGTAACCAGTGCGGAGCTTCTTTTACC 450 
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590 ACACTGGTGAACGCCCCTTCCACTGTAACCAGTGTGGAGCTTCTTTTACT 639 



451 CAGAAGGGCAACCTTCTGAGACACATAAAGTTACACTCTGGAGAGAAGCC 5 00 
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501 CTTCAAATGTCCTTTCTGTAGCTATGCTTGTAGAAGAAGGGACGCTCTCA 550 
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690 GTTCAAATGTCCTTTCTGTAGTCACGC CTGTAGAAGAAGGGACGC CCTCA 73 9 
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740 CAGGATACCTCAGGACCCATTCTGTGGGTAAACCTCACAAGTGCAACTAC 789 
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1 I I M M M 1 M 1 I t f i I M I I I M i M M M M M M M M M M 11 
Ml 1 M II 1 M ! M M I I M M II II M M I M I M II M I ! I I M I I 

84 0 CTGCCACAACTATCTCCAGAATGTCAGCATGGAGGCTGCTGGGCAGGTCA 8 89 

***** 

701 TGAGTCACCATGTACCGCCTATGGAAGATTGTAAGGAACAAGAGCCTATC 750 

I M 1 I! M II I M M ! II M I i M ! M I M M 1 I M M I M M M M I 
1 M t I 1 M M M M M M M 1 M M I I M M M M I I 11 I I II 1 I I M 

890 TGAGT C AC C ATGT AC CTC CTATGGAAGATTGTAAGGAAC AAGAGC CTATT 93 9 

• » • • . 

751 ATGGACAACAATATTTCTCTGGTGCCTTTTGAGAGACCTGCTGTCATAGA 800 

M M II 11 M M I M M M M M M M M I M M M M M M 11 i I 1 j II 
1 I M M II II M I I M 11 M M I M I M M II M M M I II M M M I II 
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940 ATGGACAACAATATTTCTCTGGTGCCTTTTGAGAGACCTGCTGTCATAGA 989 

***** 

801 GAAGCTCACGGCAAATATGGGAAAGCGCAAAAGCTCCACTCCTCAGAAGT 850 

I I 1 I i I l I i I 1 I ! I I i I I I 1 i I II 1 ! I 11 1 t I i 11 I I 1 i I ill! 
I M i M M M f I ! II ) I I I M ! II I I 1 I I I I I t 1 I 1 1 I t ! I 1 11 

990 GAAGCTCAC GGGGAAT ATGGGAAAACGTAAAAGCT C CAC T CCACAAAAGT 103 9 

• » • • • 

851 TTGTGGGGGAAAAGCTTATGCGATTCAGCTAC C CAGATATTCATTTTGAT 90 0 

1 I II I I I 1 M I ! M I I 1 I I I II I I I 1 11 I I I I I 1 1 I I 1 1 1 I I Mill! 
1 I I 1 ! II 1 I I ! I I 1 M M i M ! M f ) II J i 1 ) I I M I I I I ( t Mil!! 

104 0 TTGTGGGGGAAAAGCTCATGCGATTCAGCTACCCAGATATTCACTTTGAT 1089 



901 ATGAACTTAACATATGAGAAGGAGGCTGAGCTGATGCAGTCTCATATGAT 950 
I I I I ! I I I 1 ! I I I I ! I I ! I M I 1 I M I I M I i 1 I I I I i i I 1 i I M I I I I I 

I t II t I t I 1 I I M I M I If [ I I I I I M M M I I i I M M I M M i I I I I J 

1090 ATGAACTTAACATATGAGAAGGAGGCTGAGCTGATGCAGTCTCATATGAT 113 9 



951 GGACCAAGCCATCAACAATGCAATCACCTACCTTGGAGCTGAGGCCCTTC 1000 

I M M ! It II M M II ! I I I i II t I I I I 11 II I I II I 11 1 I I I It I It t I 
I I I M 11 1 I I It M I I M I M I M 1 I M I M It I I M II M M M 1 M 11 

1140 GGACCAAGCCATCAACAATGCAATC ACCTACCTTGGAGCTGAGGC C CTT C 1189 

■ * * • t 

1001 AC C CT CTGATGCAGCATGCAC CAAGC ACAATCGCTGAGGTGGC CCCAGTT 1050 

I I M I M M I M M M I 1 I M M M I I M M M I I 1 I 11 I I I 1 1 11 
1 ! I I 1 11 I 1 I 1 I I 1 I I 1 I 1 l I I I 1 I I I I I 1 I 1 1 I 1 1 I I 1 I I 1 I 1 I t 

1190 ACCCTCTGATGCAGCACCCGCCAAGCACAATCGCTGAAGTGGC CCCAGTT 1239 

* • • ■ * 

1051 ATAAGCT CAGCTTATTCT CAGGT CTATCATCC AAAC AGGATAGAAAGAC C 1100 

M 1 I I I ! I 1 I M I I 1 M I I 1 M I M I M I M I I M I M M M } I II M I 
I I 1 I I I 1 I i t t 1 I I i I I ! 1 I 1 I I I 1 I I I 1 I I I i I 1 M I I I I M 1 I I M 1 

1240 ATAAGCT CAGCTTATTCTCAGGTCTATCATCCAAATAGGATAGAAAGAC C 1289 



1101 CATTAGCAGGGAAAC AT CTGATAGT CACGAAAAC AAC ATGGATGGC CC C A 1150 

I M M I l I 1 t It I I I I 1 I i I I I 1! I I I 1 I 1 1 I I I 1 M i M I I 1 I 
1 I I I 1! I I I i I 1 ,111111111 I I i I I I I M i I I 1 I 1 1 I 1 1 I 11 

TAGCAGGGAAACTGCTGATAGTCATGAAAACAACATGGATGGCCCCA 133 9 



I 1 

1290 CA' 



1151 TCTCTCTCATCAGACCAAAGAGTCGAC C C CAGGAAAGAGAGGC CT CGC C C 1200 

I I I I I I I I I I I 1 I I I I l 1 I I I I S I I I I I I I I! I I I I 1 I I I 1 I I II I 1 1 I 
I I i I I j M I M I M M I I 1 1 1 I I I I M I 1 I I 1 I I M I I I I I 1 1 It I lit 

1340 TCTCTCTCATCAGACCAAAGAGTCGACCCCAGGAAAGAGAGGCCTCTCCC 13 89 

« * * * * 

1201 AGCAATAGCTGCCTCGATTCTACTGACTCAGAAAGTAGCCATGATGACCG 1250 

1 1 I I 1 I 1 I ! 1 i I 1 I Mill I 1 I t 1 I I M 11 I M I M i I 1 I 11 t it I 
I 1 1 M I I I I I I M I I I I f I M I I I I ! J I I ) I I 1 ! 1 I II I I I t 1 I I 1 

1390 AGCAATAGCTGCCTGGATTCCACTGACTCAGAAAGCAGCCATGATGACCA 14 3 9 



1251 CCAGTCCTACCAAGGAAACCCTGCCTTAAATCCCAAGAGGAAACAAAGCC 13 0 0 

I U M 1 M I 11 SI I ! I I 1 I M I ! It I 1 1 II I 11 I I 1 t 1 M 1! 1 j I I I I 1 
! ! I ! I I I 11 1 I I M I I lillllMIIMNIIIIIIIIIIMIIIIIll 

144 0 CCAGTCCTACCAAGGACACCCTGCCTTAAATCCCAAGAGGAAACAAAGCC 14 8 9 



13 01 C AGCTTACATGAAGGAGGATGTC AAGGCTTTGGATGC TACCAAGGCC C C C 1350 

I [ I I I I I I I I I I M I I I 1 I M I M I I I I 1 t I 1 I I I I I i 1 I 1 I I 1 I I 
I I 11 It 1 I I i l I I I M it 1 t t I I I I 1 I M M 1 I i MMMIM! M 

14 90 CAGCTTACATGAAGGAGGATGTCAAAGCTTTGGATACTACC AAGGCTCCT 153 9 

***** 

1351 AAGGGCTCTCTGAAGGACATCTATAAGGTTTTCAATGGAGAAGGAGAACA 1400 

I I I I 1 I I I i ( ! I ! i M I I I I I I ! MM! I ) I 1 1 t I 1 1 I 1 t M 1 1 11 I 
| I I I I I I 1 I M I M I I I 1 II I 1 I i 1 It I I I I t 1 It I I 1 1 11 M II ( i 

1540 AAGGGCTCTCTGAAGGACATCTACAAGGTCTTCAATGGGGAAGGAGAACA 1589 

***** 

14 01 GATAAGGGCCTTCAAGTGTGAGCACTGCCGAGTCCTTTTTCTAGACCATG 1450 

111 I I I ! I 1 I I I I I 1 I M M ! I I I I I I I I !! I M I I 1 ! I! I 11 I ! I I 1 

Ml M II M I M M I 1 11 I M I M I I M I M It I I M I M I M I I I I I 

1590 GATTAGGGCCTTCAAGTGTGAGCACTGCCGAGTCCTTTTCCTAGACCATG 163 9 

***** 

1451 TCATGTACACCATTCACATGGGTTGCCATGGCTACCGGGACCCACTGGAA 150 0 

M I I I II I M I M 1 SI I M M 1 M M M t M i M M M M M M M M M 
M I M II I M M I It M M 1 1 I II 1 M i I 1 II t I II 1 1 I M t II I I I I M 

1640 TCATGTACACCATTCACATGGGTTGCCATGGCTACCGGGACCCACTGGAA 1689 

• » - . . 

1501 TGCAACATCTGTGGCTACAGAAGCCAGGACCGCTACGAATTTTCATCACA 155 0 

M i M M 1 I 1 M M M M I M 1 I I I M I 1 M I I II M I I I i M I M 
M 1 M I M M 11 M II I I M M M 11 1 I II I II M M M M I I M I 

16 90 TGTAACATCTGTGGCTACAGAAGCC AGGACCGTTATGAGTTTTCATCACA 173 9 
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1551 CATTGTTGGGGGGCAGCACACATTCCACTAGGCGTTTGCATTCCAAGG 1598 

! 1 l i I I I i Ml I M I M I 1 I ! I I I ! I M I I Ml 1 M M M I i 

M I M M I I M M M M M ! M M i M M i MI M M M M I 

1740 CATTGTTCGAGGGGAGCACACATTCCACTAGGCCTTTTCATTCCAAAG 1787 
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1 METEAIDGYITCDNELSPEREHSNMAIDLTSSTPNGQHASPSHMTSTDSV 50 

u t i li i M I it i I t M M M u I i i I ! I I I M U I i i l I i I I t i I M i i 

I M I I i i M I M f M I M 1 M j 1 1 1 I 1 1 I M II 11 1 11 I It t I M M M 1 

1 METEAIDGYITCDNELSPEREHSNMAIDLTSSTPNGQHASPSHMTSTDSV 50 

* • ♦ » ■ 

51 KLEMQSDEECDRKPLSREDEIRGHDEGSSLEEPLIESSEVADHREVQELQ 100 

1 1 U 1 f II i M I II I II 1 M I ! It M I II 1 1 M 1 11 M 1 11 I 1 II M M 1 
IIMMIMMIMIMMMIMIIIIIiMMIIIIIIIllMIIIII 

51 KLEMQSDEECDRKPLSREDEIRGHDEGSSLEEPLIESSEVADNREVQELQ 100 

* * ■ • * 

101 GEGGIRLPNGKLKCDVCGMVCIGPNVLMVHKRSHTGERPFHCNQCGASFT 150 

I ! i I 1 I ! M I I I ! M 1 I I I I I II I I I ! I I ! I ! I I I I I 1 !! I I I I I 1 I I I I 
IIIIIIMIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIMIIIINI 

101 GEGGIRLPNGKLKCDVCGMVCIGPNVLMVHKRSHTGERPFHCNQCGASFT 150 

* * • * * 

151 QKGNLLRHIKLHSGEKPFKCPFCSHACRRRDALTGYLRTHSVGKPHKCNY 200 

II I I I I I f I M 1 M f I i I M 1 M I M I M ) 1 I II M M M M I ! 1 I I II t 
I I I I I I I M I I I I I M I 1 I I M 1 I S ! I I I I I I I I I I I i I I I II I I I I I I I 

151 QKGNLLRHIKLHSGEKPFKCPFCSHACRRRDALTGYLRTHSVGKPHKCNY 200 

w * + * # 

201 CGRSYKQRSSLEEHKERCHNYLQNVSMEAAGQVMSHHVPPMEDCKEQEPI 250 

1 I I II I I I 11 I M l I I 11 I I I I I I f M I II i I i \ II M M II I U I I 1 I 1 
1 M I I I I I II I ! U I I I I I II I I I I I I I I I I I SI I I t I I I I I M 1 I 1! M 

201 CGRSYKQRSSLEEHKERCHNYLQNVSMEAAGQVMSHHVPPMEDCKEQEPI 250 

■ • * • * 

251 MDNNISLVPFERPAVIEKLTGNMGKRKSSTPQKFVGEKLMRFSYPDIHFD 3 00 

M M I 11 M M II 1 I I t I M M I ! 1 I ! I I I I 1 1 I I M I M M 1 I M M M 
I M I I 1 I I M I M I I 1 M 1 1 I I I I i I M 11 I I 1 I M 1 I It M I 1 I M I II 

251 MDNNISLVPFERPAVIEKLTGNMGKRKSSTPQKFVGEKLMRFSYPDIHFD 300 
3 01 MNLTYEKEAELMQSHMMDQAINKAITYLGAEALHPLMQHPPSTIAEVAPV 350 

I I I I I I I I I I I I I ! I I I i I I I M I I ! I I 1 M l I I M I II I I l I I I I i I M 
1 II I M I I M M 1 I i I 1 i I I I f I 11 ! I I I i M I 11 I It II I I 11 M I It I 

3 01 MNLTYEKEAELMQSHMMDQAINNAITYLGAEALHPLMQHPPSTIAEVAPV 350 

* * * # # 

351 ISSAYSQVYHPNRIERPISRETADSHENWMDGPISLIRPKSRPQEREASP 400 

I 1 I 1 I 1 I I 1 I I I I I f I I 11 l 1 II 11 I I 1 I I 11 ! M 11 M I M 11 M I M I 
M M U II M I I i I I I i I I i M 1 I 1 M 1 II 1 11 I I 1 I I I I I 1 I I I I I I I i 

351 ISSAYSQVYHPNRIERPISRETADSHENNMDGPISLIRPKSRPQEREASP 400 

■ » * • • 

401 SNSCLDSTDSESSHDDHQSYQGHPALNPKRKQSPAYMKEDVKALDTTKAP 450 

i I I 1 I 1 I I I M I I I I M I 1 ! II 1 M I I I I I I I I I I I I I I 1 I ! I ! M I I I I 

II 1 I M I M M 1 I f I M M M ) I I II 1! I 1 II M M I I I M I 11 I I II I 1 

401 SNSCLDSTDSESSHDDHQSYQGHPALNPKRKQSPAYMKEDVKALDTTKAP 450 

* • » • » 

451 KGSLKDIYKVFNGEGEQIRAFKCEHCRVLFLDHVMYTIHMGCHGYRDPLE 500 

j I I I I I I I 1 I ! I ! M 1 I I I I i I I 1 I I I I ! I I M I I I l I I I I I I i I I I I I I 
I I 1 I I I I 1 I M I 1 i 1 I I i ! ! 1 i M I M I i i II i I M I I 1 I M II I II I ) I 

4 51 KGSLKDIYKVFNGEGEQIRAFKCEHCRVLFLDHVMYTIHMGCHGYRDPLE 500 

* ■ 

501 CNICGYRSQDRYEFSSHIVRGEHTFH 526 

I I I II I I I I 1 M I I I I I I I I I I I I I I 

II I M I 11 I t II I I 1 1 1 1 I I i 1 11 It 

501 CNICGYRSQDRYEFSSHIVRGEHTFH 526 
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SEQUENCE LISTING 



<110> Georgopoulos, Katia 
Mogan, Bruce A. 
Kelly, Clair 

<120> THE HELIOS GENE 

<130> 10287/043001 

<150> US 60/076,325 
<151> 1998-02-27 

<160> 17 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1598 
<212> DNA 

<213> Mus musculus 





<220> 
<221> 
<222> 


CDS 
(1) • 


, • (1578) 
























!IJ <400> 

l ft atg gaa aca 

; "fi Met Glu Thr 

i 


1 

gac 
Asp 


get 
Ala 
5 


att 
He 


gat 
Asp 


ggc 

Gly 


tat 
Tyr 


at a 
He 
10 


aca 
Thr 


tgt gac 
Cys Asp 


aat 
Asn 


gag 

Glu 

15 


ctt 
Leu 


48 


^-s? tea 
Ser 


ccc 
Pro 


gaa 
Glu 


ggg 

Gly 
20 


gaa 
Glu 


cac 
His 


gec 
Ala 


aat 
Asn 


atg 
Met 
25 


gec 
Ala 


att 
He 


gac 
Asp 


etc 
Leu 


acc 
Thr 
30 


tea 
Ser 


age 
Ser 


96 


s zf: acg 
1 y Thr 

i: 


ccc 
Pro 


aat 
Asn 

35 


gga 
Gly 


cag 
Gin 


cac 
His 


gec 
Ala 


teg 
Ser 
40 


cca 
Pro 


agt 
Ser 


cac 

His 


atg 
Met 


aca 
Thr 
45 


age 
Ser 


aca 
Thr 


aat 
Asn 


144 


^ tct 

j.w = ser 


gta 
Val 
50 


aag 
Lys 


ctg 
Leu 


gaa 
Glu 


atg 
Met 


cag 
Gin 
55 


agt 
Ser 


gat 
Asp 


gaa 
Glu 


gag 
Glu 


tgt 
Cys 
60 


gac 
Asp 


agg 
Arg 


cag 
Gin 


ccc 
Pro 


192 


ctg 
Leu 
65 


age 
Ser 


cgt 
Arg 


gag 
Glu 


gat 
Asp 


gag 
Glu 
70 


ate 
He 


agg 
Arg 


ggc 
Gly 


cac gat 
His Asp 
75 


gag ggg 
Glu Gly 


age 
Ser 


age 
Ser 


eta 
Leu 
80 


240 


gaa 
Glu 


gaa 
Glu 


ccc 
Pro 


eta 
Leu 


att 
He 
85 


gag 
Glu 


age 
Ser 


age 
Ser 


gag 
Glu 


gtg gee 
Val Ala 
90 


gac aac 
Asp Asn 


agg 
Arg 


aaa 
Lys 
95 


gtc 

Val 


288 


cag gac 
Gin Asp 


ctt 
Leu 


caa 
Gin 
100 


ggc 
Gly 


gag 
Glu 


gga 
Gly 


gga 
Gly 


ate 
He 
105 


egg 
Arg 


Ctt 

Leu 


ccg 
Pro 


aat 
Asn 


ggt 
Gly 
110 


aaa 
Lys 


ctg 
Leu 


336 


aaa 
Lys 


tgt 
Cys 


gac 
Asp 
115 


gtc 
Val 


tgt 
Cys 


ggc atg 
Gly Met 


gtt 
Val 
120 


tgc 
Cys 


att ggg 
He Gly 


ccc 
Pro 


aat 
Asn 
125 


gtg 
val 


ctt 
Leu 


atg 
Met 


384 



-1- 



gta cat aaa agg agt cac act ggt gag egg ccc ttc cac tgt aac cag 432 
Val His Lys Arg Ser His Thr Gly Glu Arg Pro Phe His Cys Asn Gin 
130 135 140 

tgc gga get tct ttt acc cag aag ggc aac ctt ctg aga cac ata aag 480 
Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu Leu Arg His lie Lys 
145 150 155 160 

tta cac tct gga gag aag ccc ttc aaa tgt cct ttc tgt age tat get 528 
Leu His Ser Gly Glu Lys Pro Phe Lys Cys Pro Phe Cys Ser Tyr Ala 

165 170 175 

tgt aga aga agg gac get etc aca gga cac etc agg acc cat tct gtg 576 
Cys Arg Arg Arg Asp Ala Leu Thr Gly His Leu Arg Thr His Ser Val 

180 185 190 

ggt aaa cct cac aag tgt aac tac tgt ggc cga age tac aag cag cgc 624 
Gly Lys Pro His Lys Cys Asn Tyr Cys Gly Arg Ser Tyr Lys Gin Arg 

195 200 205 

acg tea ctg gag gaa cac aag gaa cgc tgt cac aac tat etc cag aat 672 
Thr Ser Leu Glu Glu His Lys Glu Arg Cys His Asn Tyr Leu Gin Asn 
210 215 220 

gtc age atg gag get gec ggg cag gtc atg agt cac cat gta ccg cct 720 
Val Ser Met Glu Ala Ala Gly Gin Val Met Ser His His Val Pro Pro 
225 230 235 240 

atg gaa gat tgt aag gaa caa gag cct ate atg gac aac aat att tct 768 
Met Glu Asp Cys Lys Glu Gin Glu Pro lie Met Asp Asn Asn lie Ser 

245 250 255 

ctg gtg cct ttt gag aga cct get gtc ata gag aag etc acg gca aat 816 
Leu Val Pro Phe Glu Arg Pro Ala Val lie Glu Lys Leu Thr Ala Asn 

260 265 270 

atg gga aag cgc aaa age tec act cct cag aag ttt gtg ggg gaa aag 864 
Met Gly Lys Arg Lys Ser Ser Thr Pro Gin Lys Phe Val Gly Glu Lys 
275 280 285 

ctt atg cga ttc age tac cca gat att cat ttt gat atg aac tta aca 912 
Leu Met Arg Phe Ser Tyr Pro Asp lie His Phe Asp Met Asn Leu Thr 
290 295 300 

tat gag aag gag get gag ctg atg cag tct cat atg atg gac caa gee 960 
Tyr Glu Lys Glu Ala Glu Leu Met Gin Ser His Met Met Asp Gin Ala 
305 310 315 320 

ate aac aat gca ate acc tac ctt gga get gag gee ctt cac cct ctg 1008 
lie Asn Asn Ala He Thr Tyr Leu Gly Ala Glu Ala Leu His Pro Leu 

325 330 335 

atg cag cat gca cca age aca ate get gag gtg gec cca gtt ata age 1056 
Met Gin His Ala Pro Ser Thr He Ala Glu Val Ala Pro Val He Ser 

340 345 350 

tea get tat tct cag gtc tat cat cca aac agg ata gaa aga ccc att 1104 
Ser Ala Tyr Ser Gin Val Tyr His Pro Asn Arg He Glu Arg Pro He 
355 360 365 



-2- 



age agg gaa aca tct gat agt cac gaa aac aac atg gat ggc ccc ate 1152 
Ser Arg Glu Thr Ser Asp Ser His Glu Asn Asn Met Asp Gly Pro lie 
370 375 380 

tct etc ate aga cca aag agt cga ccc cag gaa aga gag gee teg ccc 1200 
Ser Leu lie Arg Pro Lys Ser Arg Pro Gin Glu Arg Glu Ala Ser Pro 
385 390 395 400 

age aat age tgc etc gat tct act gac tea gaa agt age cat gat gac 1248 
Ser Asn Ser Cys Leu Asp Ser Thr Asp Ser Glu Ser Ser His Asp Asp 

405 410 415 

cgc cag tec tac caa gga aac cct gee tta aat ccc aag agg aaa caa 1296 
Arg Gin Ser Tyr Gin Gly Asn Pro Ala Leu Asn Pro Lys Arg Lys Gin 

420 425 430 

age cca get tac atg aag gag gat gtc aag get ttg gat get acc aag 1344 
Ser Pro Ala Tyr Met Lys Glu Asp Val Lys Ala Leu Asp Ala Thr Lys 
435 440 445 

gee ccc aag ggc tct ctg aag gac ate tat aag gtt ttc aat gga gaa 1392 
Ala Pro Lys Gly Ser Leu Lys Asp lie Tyr Lys Val Phe Asn Gly Glu 
450 455 460 

gga gaa cag ata agg gec ttc aag tgt gag cac tgc cga gtc ctt ttt 1440 
Gly Glu Gin He Arg Ala Phe Lys Cys Glu His Cys Arg Val Leu Phe 
465 470 475 480 

eta gac cat gtc atg tac acc att cac atg ggt tgc cat ggc tac egg 1488 
Leu Asp His Val Met Tyr Thr He His Met Gly Cys His Gly Tyr Arg 

485 490 495 

gac cca ctg gaa tgc aac ate tgt ggc tac aga age cag gac cgc tac 1536 
Asp Pro Leu Glu Cys Asn He Cys Gly Tyr Arg Ser Gin Asp Arg Tyr 

500 505 510 

gaa ttt tea tea cac att gtt ggg ggg cag cac aca ttc cac 1578 
Glu Phe Ser Ser His He Val Gly Gly Gin His Thr Phe His 
515 520 525 

taggcgtttg cattccaagg 1598 

<210> 2 
<211> 526 
<212> PRT 

<213> Mus musculus 





<400> 


2 


Met 


Glu 


Thr 


Asp 


1 








Ser 


Pro 


Glu 


Gly 








20 


Thr 


Pro 


Asn 


Gly 






35 




Ser 


Val 


Lys 


Leu 




50 






Leu 


Ser 


Arg 


Glu 


65 








Glu 


Glu 


Pro 


Leu 


Gin 


Asp 


Leu 


Gin 








100 



Ala 


He 


Asp 


Gly 


5 








Glu 


His 


Ala 


Asn 


Gin 


His 


Ala 


Ser 








40 


Glu 


Met 


Gin 


Ser 






55 




Asp 


Glu 


He 


Arg 




70 






He 


Glu 


Ser 


Ser 


85 








Gly 


Glu 


Gly 


Gly 



Tyr 


He 


Thr 


Cys 




10 






Met 


Ala 


He 


Asp 


25 








Pro 


Ser 


His 


Met 


Asp 


Glu 


Glu 


Cys 








60 


Gly 


His 


Asp 


Glu 






75 




Glu 


Val 


Ala 


Asp 




90 






He 


Arg 


Leu 


Pro 



105 



Asp Asn Glu Leu 
15 

Leu Thr Ser Ser 
30 

Thr Ser Thr Asn 
45 

Asp Arg Gin Pro 

Gly Ser Ser Leu 

80 

Asn Arg Lys Val 
95 

Asn Gly Lys Leu 
110 



-3- 



Lys 


Cys 


Asp 


Val 


Cys 


Gly Met 






115 








Val 


His 


Lys 


Arg 


Ser 


His Thr 




130 






135 


Cys 


Gly 


Ala 


Ser 


Phe 


Thr Gin 


145 










150 


Leu 


His 


Ser 


Gly 


Glu 


Lys Pro 










165 




Cys 


Arg 


Arg 


Arg 


Asp 


Ala Leu 








180 






Gly 


Lys 


Pro 


His 


Lys 


Cys Asn 






195 








Thr 


Ser 


Leu 


Glu 


Glu 


His Lys 




210 








215 


Val 


Ser 


Met 


Glu 


Ala 


Ala Gly 


225 










230 


Met 


Glu 


Asp 


Cys 


Lys 


Glu Gin 










245 




Leu 


Val 


Pro 


Phe 


Glu 


Arg Pro 








260 






Met 


Gly 


Lys 


Arg 


Lys 


Ser Ser 






275 








Leu 


Met 


Arg 


Phe 


Ser 


Tyr Pro 




290 








295 


^ Tyr 


Glu 


Lys 


Glu 


Ala 


Glu Leu 


™ 305 








310 


^ He 


Asn 


Asn 


Ala 


He 


Thr Tyr 










325 




If! Met 


Gin 


His 


Ala 


Pro 


Ser Thr 








340 






Ser 


Ala 


Tyr 


Ser 


Gin 


Val Tyr 






355 








\M Ser 


Arg 


Glu 


Thr 


Ser 


Asp Ser 




370 








375 


Ser 


Leu 


He 


Arg 


Pro 


Lys Ser 


U 385 










390 


zJ Ser 


Asn 


Ser 


Cys 


Leu 


Asp Ser 










405 




|| Arg 


Gin 


Ser 


Tyr 


Gin 


Gly Asn 


3 BBS. 






420 






1! Ser 


Pro 


Ala 


Tyr 


Met 


Lys Glu 


* c 




435 






0 Ala 


Pro 


Lys 


Gly 


Ser 


Leu Lys 




450 








455 


Gly 


Glu 


Gin 


He 


Arg 


Ala Phe 


465 










470 


Leu 


Asp 


His 


Val 


Met 


Tyr Thr 










485 




Asp 


Pro 


Leu 


Glu 


Cys 


Asn He 








500 






Glu 


Phe 


Ser 


Ser 


His 


He Val 



515 



<210> 3 

<211> 1520 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) • . • (1500) 



Val Cys 


He 


Gly Pro Asn 


Val 


Leu 


Met 


120 




125 








Gly Glu 


Arg 


Pro Phe His 


Cys 


Asn 


Gin 






140 








Lys Gly 


Asn 


Leu Leu Arg 


His 


He 


Lys 






155 






160 


Phe Lys 


Cys 


Pro Phe Cys 


Ser 


Tyr 


Ala 




170 






175 




Thr Gly 


His 


Leu Arg Thr 


His 


Ser 


Val 


185 






190 






Tyr Cys 


Gly 


Arg Ser Tyr 


Lys 


Gin 


Arg 


200 




205 








Glu Arg 


Cys 


His Asn Tyr 


Leu 


Gin 


Asn 






220 








Gin Val 


Met 


Ser His His 


Val 


Pro 


Pro 






235 






240 


Glu Pro 


He 


Met Asp Asn 


Asn 


He 


Ser 




250 






255 




Ala Val 


He 


Glu Lys Leu 


Thr 


Ala 


Asn 


265 




270 






Thr Pro 


Gin 


Lys Phe Val 


Gly 


Glu 


Lys 


280 




285 








Asp He 


His 


Phe Asp Met 


Asn 


Leu 


Thr 






300 








Met Gin 


Ser 


His Met Met 


Asp 


Gin 


Ala 






315 




320 


Leu Gly 


Ala 


Glu Ala Leu 


His 


Pro 


Leu 


330 






335 




He Ala 


Glu 


Val Ala Pro 


Val 


He 


Ser 


345 






350 






His Pro 


Asn 


Arg He Glu 


Arg 


Pro 


He 


360 




365 






His Glu 


Asn 


Asn Met Asp 


Gly 


Pro 


He 






380 








Arg Pro 


Gin 


Glu Arg Glu 


Ala 


Ser 


Pro 






395 






400 


Thr Asp 


Ser 


Glu Ser Ser 


His 


Asp 


Asp 




410 






415 




Pro Ala 


Leu 


Asn Pro Lys 


Arg 


Lys 


Gin 


425 






430 






Asp Val 


Lys 


Ala Leu Asp 


Ala 


Thr 


Lys 


440 




445 








Asp He 


Tyr 


Lys Val Phe 


Asn 


Gly 


Glu 






460 








Lys Cys 


Glu 


His Cys Arg 


Val 


Leu 


Phe 






475 






480 


He His 


Met 


Gly Cys His 


Gly 


Tyr 


Arg 




490 






495 




Cys Gly 


Tyr 


Arg Ser Gin 


Asp 


Arg 


Tyr 


505 






510 






Gly Gly 


Gin 


His Thr Phe 


His 






520 




525 
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<400> 3 

atg gaa aca gac get att gat ggc tat ata aca tgt gac aat gag ctt 48 
Met Glu Thr Asp Ala lie Asp Gly Tyr lie Thr Cys Asp Asn Glu Leu 
15 10 15 

tea ccc gaa ggg gaa cac gec aat atg gec att gac etc acc tea age 96 
Ser Pro Glu Gly Glu His Ala Asn Met Ala lie Asp Leu Thr Ser Ser 

20 25 30 

acg ccc aat gga cag cac gec teg cca agt cac atg aca age aca aat 144 
Thr Pro Asn Gly Gin His Ala Ser Pro Ser His Met Thr Ser Thr Asn 
35 40 45 

tct gta aag ctg gaa atg cag agt gat gaa gag tgt gac agg cag ccc 192 
Ser Val Lys Leu Glu Met Gin Ser Asp Glu Glu Cys Asp Arg Gin Pro 
50 55 60 

ctg age cgt gag gat gag ate agg ggc cac gat gag ggg age age eta 240 
Leu Ser Arg Glu Asp Glu lie Arg Gly His Asp Glu Gly Ser Ser Leu 
65 70 75 80 

gaa gaa ccc eta att gag age age gag gtg gec gac aac agg aaa gtc 288 
Glu Glu Pro Leu lie Glu Ser Ser Glu Val Ala Asp Asn Arg Lys Val 

85 90 95 

cag gac ctt caa ggc gag gga gga ate egg ctt ccg aat ggt gag egg 336 
Gin Asp Leu Gin Gly Glu Gly Gly lie Arg Leu Pro Asn Gly Glu Arg 

100 105 110 

ccc ttc cac tgt aac cag tgc gga get tct ttt acc cag aag ggc aac 384 
Pro Phe His Cys Asn Gin Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn 
115 120 125 

ctt ctg aga cac ata aag tta cac tct gga gag aag ccc ttc aaa tgt 432 
Leu Leu Arg His lie Lys Leu His Ser Gly Glu Lys Pro Phe Lys Cys 
130 135 140 

cct ttc tgt age tat get tgt aga aga agg gac get etc aca gga cac 480 
Pro Phe Cys Ser Tyr Ala Cys Arg Arg Arg Asp Ala Leu Thr Gly His 
145 150 155 160 

etc agg acc cat tct gtg ggt aaa cct cac aag tgt aac tac tgt ggc 528 
Leu Arg Thr His Ser Val Gly Lys Pro His Lys Cys Asn Tyr Cys Gly 

165 170 175 

cga age tac aag cag cgc acg tea ctg gag gaa cac aag gaa cgc tgt 576 
Arg Ser Tyr Lys Gin Arg Thr Ser Leu Glu Glu His Lys Glu Arg Cys 

180 185 190 

cac aac tat etc cag aat gtc age atg gag get gec ggg cag gtc atg 624 
His Asn Tyr Leu Gin Asn Val Ser Met Glu Ala Ala Gly Gin Val Met 
195 200 205 

agt cac cat gta ccg cct atg gaa gat tgt aag gaa caa gag cct ate 672 
Ser His His Val Pro Pro Met Glu Asp Cys Lys Glu Gin Glu Pro lie 
210 215 220 

atg gac aac aat att tct ctg gtg cct ttt gag aga cct get gtc ata 720 
Met Asp Asn Asn lie Ser Leu Val Pro Phe Glu Arg Pro Ala Val lie 
225 230 235 240 
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gag aag etc acg gca aat atg gga aag cgc aaa age tec act cct cag 768 
Glu Lys Leu Thr Ala Asn Met Gly Lys Arg Lys Ser Ser Thr Pro Gin 

245 250 255 

aag ttt gtg ggg gaa aag ctt atg cga ttc age tac cca gat att cat 816 
Lys Phe Val Gly Glu Lys Leu Met Arg Phe Ser Tyr Pro Asp lie His 

260 265 270 

ttt gat atg aac tta aca tat gag aag gag get gag ctg atg cag tct 864 
Phe Asp Met Asn Leu Thr Tyr Glu Lys Glu Ala Glu Leu Met Gin Ser 
275 280 285 

cat atg atg gac caa gee ate aac aat gca ate ace tac ctt gga get 912 
His Met Met Asp Gin Ala lie Asn Asn Ala lie Thr Tyr Leu Gly Ala 
290 295 300 

gag gec ctt cac cct ctg atg cag cat gca cca age aca ate get gag 960 
Glu Ala Leu His Pro Leu Met Gin His Ala Pro Ser Thr He Ala Glu 
305 310 315 320 

gtg gee cca gtt ata age tea get tat tct cag gtc tat cat cca aac 1008 
Val Ala Pro Val He Ser Ser Ala Tyr Ser Gin Val Tyr His Pro Asn 

325 330 335 

agg ata gaa aga ccc att age agg gaa aca tct gat agt cac gaa aac 1056 
Arg He Glu Arg Pro He Ser Arg Glu Thr Ser Asp Ser His Glu Asn 

340 345 350 

aac atg gat ggc ccc ate tct etc ate aga cca aag agt cga ccc cag 1104 
Asn Met Asp Gly Pro He Ser Leu He Arg Pro Lys Ser Arg Pro Gin 
355 360 365 

gaa aga gag gec teg ccc age aat age tgc etc gat tct act gac tea 1152 
Glu Arg Glu Ala Ser Pro Ser Asn Ser Cys Leu Asp Ser Thr Asp Ser 
370 375 380 

gaa agt age cat gat gae cgc cag tec tac caa gga aac cct gec tta 1200 
Glu Ser Ser His Asp Asp Arg Gin Ser Tyr Gin Gly Asn Pro Ala Leu 
385 390 395 400 

aat ccc aag agg aaa caa age cca get tac atg aag gag gat gtc aag 1248 
Asn Pro Lys Arg Lys Gin Ser Pro Ala Tyr Met Lys Glu Asp Val Lys 

405 410 415 

get ttg gat get acc aag gec ccc aag ggc tct ctg aag gac ate tat 1296 
Ala Leu Asp Ala Thr Lys Ala Pro Lys Gly Ser Leu Lys Asp He Tyr 

420 425 430 

aag gtt ttc aat gga gaa gga gaa cag ata agg gec ttc aag tgt gag 1344 
Lys Val Phe Asn Gly Glu Gly Glu Gin He Arg Ala Phe Lys Cys Glu 
435 440 445 

cac tgc cga gtc ctt ttt eta gac cat gtc atg tac acc att cac atg 1392 
His Cys Arg Val Leu Phe Leu Asp His Val Met Tyr Thr He His Met 
450 455 460 

ggt tgc cat ggc tac egg gac cca ctg gaa tgc aac ate tgt ggc tac 1440 
Gly Cys His Gly Tyr Arg Asp Pro Leu Glu Cys Asn He Cys Gly Tyr 
465 470 475 480 
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aga age cag gac cgc tac gaa ttt tea tea cac att gtt ggg ggg cag 1488 
Arg Ser Gin Asp Arg Tyr Glu Phe Ser Ser His lie Val Gly Gly Gin 

485 490 495 

cac aca ttc cac taggcgtttg cattccaagg 1520 
His Thr Phe His 

500 



<210> 4 

<211> 500 

<212> PRT 

<213> Mus musculus 



<400> 4 



Met 


Glu 


Thr 


Asp 


Ala 


He 


Asp 


Gly 


Tyr 


He Thr Cys 


Asp Asn 


Glu Leu 


1 








5 










10 




15 


Ser 


Pro 


Glu 


Gly 


Glu 


His 


Ala 


Asn 


Met 


Ala He Asp Leu Thr 


Ser Ser 








20 










25 




30 




Thr 


Pro 


Asn 


Gly 


Gin 


His 


Ala 


Ser 


Pro 


Ser His Met 


Thr Ser 


Thr Asn 






35 










40 






45 




Ser 


Val 


Lys 

J. 


Leu 


Glu 


Met 


Gin 


Ser 


Asp 


Glu Glu Cys 


Asp Arg 


Gin Pro 




50 










55 






60 






Q. Leu 


Ser 


Arg 


Glu 


Asp 


Glu 


He 


Arg 


Gly 


His Asp Glu Gly Ser Ser Leu 


. #s 65 










70 








75 




80 


Glu 


Glu 


Pro 


Leu 


He 


Glu 


Ser 


Ser 


Glu 


Val Ala Asp Asn Arg Lys Val 










85 










90 




95 


If] Gin 


Asp 


Leu 


Gin 


Gly 


Glu 


Gly 


Gly 


He 


Arg Leu Pro 


Asn Gly Glu Arg 








100 










105 




110 




= r e Pro 


Phe 


His 


Cys 


Asn 


Gin 


Cys 


Gly 


Ala 


Ser Phe Thr Gin Lys Gly Asn 






115 










120 






125 




Leu 


Leu 


Arg 


His 


He 


Lys 


Leu 


His 


Ser 


Gly Glu Lys 


Pro Phe 


Lys Cys 




130 










135 






140 






Pro 


Phe 


Cys 


Ser 


Tyr 


Ala 


Cys 


Arg 


Arg 


Arg Asp Ala 


Leu Thr 


Gly His 


U 145 










150 








155 




160 


J!l Leu 


Arg 


Thr 


His 


Ser 


Val 


Gly 


Lys 


Pro 


His Lys Cys Asn Tyr Cys Gly 










165 










170 




175 


y Arg 


Ser 


Tyr 


Lys 


Gin 


Arg 


Thr 


Ser 


Leu 


Glu Glu His 


Lys Glu 


Arg Cys 








180 










185 




190 




^ His 


Asn 


Tyr 


Leu 


Gin 


Asn 


Val 


Ser 


Met 


Glu Ala Ala Gly Gin Val Met 






195 










200 






205 




y Ser 


His 


His 


Val 


Pro 


Pro 


Met 


Glu 


Asp 


Cys Lys Glu 


Gin Glu 


Pro He 




210 










215 






220 






Met 


Asp 


Asn 


Asn 


He 


Ser 


Leu 


Val 


Pro 


Phe Glu Arg 


Pro Ala 


Val He 


225 








230 








235 




240 


Glu 


Lys 


Leu 


Thr 


Ala 


Asn 


Met 


Gly 


Lys 


Arg Lys Ser 


Ser Thr 


Pro Gin 










245 










250 




255 


Lys 


Phe 


Val 


Gly 


Glu 


Lys 


Leu 


Met 


Arg 


Phe Ser Tyr 


Pro Asp 


He His 








260 










265 




270 




Phe 


Asp 


Met 


Asn 


Leu 


Thr 


Tyr 


Glu 


Lys 


Glu Ala Glu 


Leu Met 


Gin Ser 




275 








280 




285 




His 


Met 


Met 


Asp 


Gin 


Ala 


He 


Asn 


Asn 


Ala He Thr 


Tyr Leu 


Gly Ala 




290 










295 






300 




Glu 


Ala 


Leu 


His 


Pro 


Leu 


Met 


Gin 


His 


Ala Pro Ser 


Thr He 


Ala Glu 


305 










310 








315 




320 


Val 


Ala 


Pro 


Val 


He 


Ser 


Ser 


Ala 


Tyr 


Ser Gin Val Tyr His Pro Asn 










325 








330 




335 


Arg 


He 


Glu 


Arg 


Pro 


He 


Ser 


Arg 


Glu 


Thr Ser Asp 


Ser His 


Glu Asn 








340 










345 




350 




Asn 


Met 


Asp 


Gly 


Pro 


lie 


Ser 


Leu 


He 


Arg Pro Lys 


Ser Arg Pro Gin 






355 










360 






365 
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•seen 



Glu 


Arg 


Glu 


Ala 


Ser 


Pro 


Ser 


Asn 


Ser 


Cys 


Leu 


Asp 


Ser 


Thr 


Asp 


Ser 




370 










375 










380 










Glu 


Ser 


Ser 


His 


Asp 


Asp 


Arg 


Gin 


Ser 


Tyr 


Gin 


Gly 


Asn 


Pro 


Ala 


Leu 


385 










390 










395 










400 


Asn 


Pro 


Lys 


Arg 


Lys 


Gin 


Ser 


Pro 


Ala Tyr 


Met 


Lys 


Glu 


Asp 


Val 


Lys 










405 










410 










415 




Ala 


Leu 


Asp 


Ala 


Thr 


Lys 


Ala 


Pro 


Lys 


Gly 


Ser 


Leu 


Lys 


Asp 


He 


Tyr 








420 










425 










430 






Lys 


Val 


Phe 


Asn 


Gly 


Glu 


Gly Glu 


Gin 


He 


Arg 


Ala 


Phe 


Lys 


Cys 


Glu 




435 










440 










445 








His 


Cys 


Arg 


Val 


Leu 


Phe 


Leu 


Asp 


His 


Val 


Met 


Tyr 


Thr 


He 


His 


Met 












455 




















Gly 


Cys 


His 


Gly 


Tyr 


Arg 


Asp 


Pro 


Leu 


Glu 


Cys 


Asn 


He 


Cys 


Gly 


Tyr 


465 










470 










475 










480 


Arg 


Ser 


Gin 


Asp 


Arg 


Tyr 


Glu 


Phe 


Ser 


Ser 


His 


He 


Val 


Gly 


Gly 


Gin 










485 










490 










495 




His 


Thr 


Phe 


His 
































500 
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5 




























<211> 


1927 


























<212> 


DNA 




























<213> 


Homo sapiens 





















<220> 
<221> CDS 

<222> (190) ... (1767) 
<400> 5 

gcccgggcag gtcgcattgc tatagcactg actgacctct ctctctctct tttttttcct 60 
ctttcctgaa acccgacatt gtcacctcct ctttgagggt tagaagaagc tgagatctcc 120 
cgacagagct ggaaatggtg atgaatcttt tttaatcaaa ggacaatttc ttttcattgc 180 
actttgact atg gaa aca gag get att gat ggc tat ata acg tgt gac aat 231 
Met Glu Thr Glu Ala He Asp Gly Tyr He Thr Cys Asp Asn 
15 10 

gag ctt tea ccc gaa agg gag cac tec aat atg gca att gac etc ace 279 
Glu Leu Ser Pro Glu Arg Glu His Ser Asn Met Ala He Asp Leu Thr 
15 20 25 30 

tea age aca ccc aat gga cag cat gec tea cca agt cac atg aca age 327 
Ser Ser Thr Pro Asn Gly Gin His Ala Ser Pro Ser His Met Thr Ser 

35 40 45 

aca gat tea gta aag eta gaa atg cag agt gat gaa gag tgt gac agg 375 
Thr Asp Ser Val Lys Leu Glu Met Gin Ser Asp Glu Glu Cys Asp Arg 

50 55 60 

aaa ccc ctg age cgt gaa gat gag ate agg ggc cat gat gag ggt age 423 
Lys Pro Leu Ser Arg Glu Asp Glu He Arg Gly His Asp Glu Gly Ser 
65 70 75 

age eta gaa gaa ccc eta att gag age age gag gtg get gac aac agg 471 
Ser Leu Glu Glu Pro Leu He Glu Ser Ser Glu Val Ala Asp Asn Arg 
80 85 90 

gaa gtc cag gag ctt caa ggc gag gga gga ate egg ctt ccg aat ggt 519 
Glu Val Gin Glu Leu Gin Gly Glu Gly Gly He Arg Leu Pro Asn Gly 
95 100 105 110 



8 



aaa ctg aaa tgt gac gtc tgt ggc atg gtt tgc att ggg ccc aat gtg 567 
Lys Leu Lys Cys Asp Val Cys Gly Met Val Cys lie Gly Pro Asn Val 

115 120 125 

ctt atg gta cat aaa agg agt cac act ggt gaa cgc ccc ttc cac tgt 615 
Leu Met Val His Lys Arg Ser His Thr Gly Glu Arg Pro Phe His Cys 

130 135 140 

aac cag tgt gga get tct ttt act cag aag ggc aac ctt ctg aga cac 663 
Asn Gin Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu Leu Arg His 
145 150 155 

ata aag tta cac tct gga gag aag ccg ttc aaa tgt cct ttc tgt agt 711 
lie Lys Leu His Ser Gly Glu Lys Pro Phe Lys Cys Pro Phe Cys Ser 
160 165 170 

cac gec tgt aga aga agg gac gec etc aca gga tac etc agg acc cat 759 
His Ala Cys Arg Arg Arg Asp Ala Leu Thr Gly Tyr Leu Arg Thr His 
175 180 185 190 

tct gtg ggt aaa cct cac aag tgc aac tac tgt gga cga age tac aag 807 
Ser Val Gly Lys Pro His Lys Cys Asn Tyr Cys Gly Arg Ser Tyr Lys 

195 200 205 

cag cgc agt tea ctg gag gag cac aag gaa cgc tgc cac aac tat etc 855 
Gin Arg Ser Ser Leu Glu Glu His Lys Glu Arg Cys His Asn Tyr Leu 

210 215 220 

cag aat gtc age atg gag get get ggg cag gtc atg agt cac cat gta 903 
Gin Asn Val Ser Met Glu Ala Ala Gly Gin Val Met Ser His His Val 
225 230 235 

cct cct atg gaa gat tgt aag gaa caa gag cct att atg gac aac aat 951 
Pro Pro Met Glu Asp Cys Lys Glu Gin Glu Pro lie Met Asp Asn Asn 
240 245 250 

att tct ctg gtg cct ttt gag aga cct get gtc ata gag aag etc acg 999 
lie Ser Leu Val Pro Phe Glu Arg Pro Ala Val lie Glu Lys Leu Thr 
255 260 265 270 

ggg aat atg gga aaa cgt aaa age tec act cca caa aag ttt gtg ggg 1047 
Gly Asn Met Gly Lys Arg Lys Ser Ser Thr Pro Gin Lys Phe Val Gly 

275 280 285 

gaa aag etc atg cga ttc age tac cca gat att cac ttt gat atg aac 1095 
Glu Lys Leu Met Arg Phe Ser Tyr Pro Asp lie His Phe Asp Met Asn 

290 295 300 

tta aca tat gag aag gag get gag ctg atg cag tct cat atg atg gac 1143 
Leu Thr Tyr Glu Lys Glu Ala Glu Leu Met Gin Ser His Met Met Asp 
305 310 315 

caa gec ate aac aat gca ate acc tac ctt gga get gag gec ctt cac 1191 
Gin Ala lie Asn Asn Ala lie Thr Tyr Leu Gly Ala Glu Ala Leu His 
320 325 330 

cct ctg atg cag cac ccg cca age aca ate get gaa gtg gec cca gtt 1239 
Pro Leu Met Gin His Pro Pro Ser Thr lie Ala Glu Val Ala Pro Val 
335 340 345 350 
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ata age tea get tat tct cag gtc tat cat cca aat agg ata gaa aga 1287 
lie Ser Ser Ala Tyr Ser Gin Val Tyr His Pro Asn Arg lie Glu Arg 

355 360 365 

ccc att age agg gaa act get gat agt cat gaa aac aac atg gat ggc 1335 
Pro lie Ser Arg Glu Thr Ala Asp Ser His Glu Asn Asn Met Asp Gly 

370 375 380 

ccc ate tct etc ate aga cca aag agt cga ccc cag gaa aga gag gee 1383 
Pro lie Ser Leu lie Arg Pro Lys Ser Arg Pro Gin Glu Arg Glu Ala 
385 390 395 

tct ccc age aat age tgc ctg gat tec act gac tea gaa ago age cat 1431 
Ser Pro Ser Asn Ser Cys Leu Asp Ser Thr Asp Ser Glu Ser Ser His 
400 405 410 

gat gac cac cag tec tac caa gga cac cct gec tta aat ccc aag agg 1479 
Asp Asp His Gin Ser Tyr Gin Gly His Pro Ala Leu Asn Pro Lys Arg 
415 420 425 430 

aaa caa age cca get tac atg aag gag gat gtc aaa get ttg gat act 1527 
Lys Gin Ser Pro Ala Tyr Met Lys Glu Asp Val Lys Ala Leu Asp Thr 

435 440 445 

ace aag get cct aag ggc tct ctg aag gac ate tac aag gtc ttc aat 1575 
Thr Lys Ala Pro Lys Gly Ser Leu Lys Asp lie Tyr Lys Val Phe Asn 

450 455 460 

ggg gaa gga gaa cag att agg gee ttc aag tgt gag cac tgc cga gtc 1623 
Gly Glu Gly Glu Gin lie Arg Ala Phe Lys Cys Glu His Cys Arg Val 
465 470 475 

ctt ttc eta gac cat gtc atg tac ace att cac atg ggt tgc cat ggc 1671 
Leu Phe Leu Asp His Val Met Tyr Thr lie His Met Gly Cys His Gly 
480 485 490 

tac egg gac cca ctg gaa tgt aac ate tgt ggc tac aga age cag gac 1719 
Tyr Arg Asp Pro Leu Glu Cys Asn lie Cys Gly Tyr Arg Ser Gin Asp 
495 500 505 510 

cgt tat gag ttt tea tea cac att gtt cga ggg gag cac aca ttc cac 1767 
Arg Tyr Glu Phe Ser Ser His lie Val Arg Gly Glu His Thr Phe His 

515 520 525 

taggectttt cattccaaag gggaccctat gaagtaaaga ctgcacatga agaaatactg 1827 
cacttacaat cccacctttc ctcaaatgtt gtacctttta tttttttaat ataatactgg 1887 
tgataatctt attttgtgga gcagtgtcat ttgetetget 1927 





<210> 


6 














<211> 


526 














<212> 


PRT 














<213> 


Homo sapiens 














<400> 


6 












Met 


Glu Thr 


Glu Ala He Asp 


Gly Tyr 


He 


Thr 


Cys 


Asp Asn Glu Leu 


1 




5 




10 






15 


Ser 


Pro Glu 


Arg Glu His Ser 


Asn Met 


Ala 


He 


Asp 


Leu Thr Ser Ser 






20 


25 






30 


Thr 


Pro Asn 


Gly Gin His Ala 


Ser Pro 


Ser 


His 


Met 


Thr Ser Thr Asp 




35 




40 








45 


Ser 


Val Lys 


Leu Glu Met Gin 


Ser Asp 


Glu 


Glu 


Cys 


Asp Arg Lys Pro 




50 


55 








60 
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Leu Ser Arg Glu Asp Glu lie Arg Gly His Asp Glu Gly Ser Ser Leu 
65 70 75 80 

Glu Glu Pro Leu He Glu Ser Ser Glu Val Ala Asp Asn Arg Glu Val 

85 90 95 

Gin Glu Leu Gin Gly Glu Gly Gly He Arg Leu Pro Asn Gly Lys Leu 

100 105 HO 

Lys Cys Asp Val Cys Gly Met Val Cys He Gly Pro Asn Val Leu Met 

115 120 125 

Val His Lys Arg Ser His Thr Gly Glu Arg Pro Phe His Cys Asn Gin 

130 135 140 

Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu Leu Arg His He Lys 
145 150 155 160 

Leu His Ser Gly Glu Lys Pro Phe Lys Cys Pro Phe Cys Ser His Ala 

165 170 175 

Cys Arg Arg Arg Asp Ala Leu Thr Gly Tyr Leu Arg Thr His Ser Val 

180 185 190 

Gly Lys Pro His Lys Cys Asn Tyr Cys Gly Arg Ser Tyr Lys Gin Arg 

195 200 205 

Ser Ser Leu Glu Glu His Lys Glu Arg Cys His Asn Tyr Leu Gin Asn 

210 215 220 

Val Ser Met Glu Ala Ala Gly Gin Val Met Ser His His Val Pro Pro 
225 230 235 240 

Met Glu Asp Cys Lys Glu Gin Glu Pro He Met Asp Asn Asn He Ser 

245 250 255 

Leu Val Pro Phe Glu Arg Pro Ala Val He Glu Lys Leu Thr Gly Asn 

260 265 270 

Met Gly Lys Arg Lys Ser Ser Thr Pro Gin Lys Phe Val Gly Glu Lys 

275 280 285 

Leu Met Arg Phe Ser Tyr Pro Asp He His Phe Asp Met Asn Leu Thr 

290 295 300 

Tyr Glu Lys Glu Ala Glu Leu Met Gin Ser His Met Met Asp Gin Ala 
305 310 315 320 

He Asn Asn Ala He Thr Tyr Leu Gly Ala Glu Ala Leu His Pro Leu 

325 330 335 

Met Gin His Pro Pro Ser Thr He Ala Glu Val Ala Pro Val He Ser 

340 345 350 

Ser Ala Tyr Ser Gin Val Tyr His Pro Asn Arg He Glu Arg Pro He 

355 360 365 

Ser Arg Glu Thr Ala Asp Ser His Glu Asn Asn Met Asp Gly Pro He 

370 375 380 

Ser Leu He Arg Pro Lys Ser Arg Pro Gin Glu Arg Glu Ala Ser Pro 
385 390 395 400 

Ser Asn Ser Cys Leu Asp Ser Thr Asp Ser Glu Ser Ser His Asp Asp 

405 410 415 

His Gin Ser Tyr Gin Gly His Pro Ala Leu Asn Pro Lys Arg Lys Gin 

420 425 430 

Ser Pro Ala Tyr Met Lys Glu Asp Val Lys Ala Leu Asp Thr Thr Lys 

435 440 445 

Ala Pro Lys Gly Ser Leu Lys Asp He Tyr Lys Val Phe Asn Gly Glu 

450 455 460 

Gly Glu Gin He Arg Ala Phe Lys Cys Glu His Cys Arg Val Leu Phe 
465 470 475 480 

Leu Asp His Val Met Tyr Thr He His Met Gly Cys His Gly Tyr Arg 

485 490 495 

Asp Pro Leu Glu Cys Asn He Cys Gly Tyr Arg Ser Gin Asp Arg Tyr 

500 505 510 

Glu Phe Ser Ser His He Val Arg Gly Glu His Thr Phe His 

515 520 525 



<210> 7 
<211> 517 
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<212> PRT 

<213> Mus musculus 



<400> 7 

Met Asp Val Asp Glu Gly Gin Asp Met Ser Gin Val Ser Gly Lys Glu 

15 10 15 

Ser Pro Pro Val Ser Asp Thr Pro Asp Glu Gly Asp Glu Pro Met Pro 

20 25 30 

Val Pro Glu Asp Leu Ser Thr Thr Ser Gly Ala Gin Gin Asn Ser Lys 

35 40 45 

Ser Asp Arg Gly Met Gly Ser Asn Val Lys Val Glu Thr Gin Ser Asp 

50 55 60 

Glu Glu Asn Gly Arg Ala Cys Glu Met Asn Gly Glu Glu Cys Ala Glu 
65 70 75 80 

Asp Leu Arg Met Leu Asp Ala Ser Gly Glu Lys Met Asn Gly Ser His 

85 90 95 

Arg Asp Gin Gly Ser Ser Ala Leu Ser Gly Val Gly Gly lie Arg Leu 

100 105 HO 

Pro Asn Gly Lys Leu Lys Cys Asp He Cys Gly He Val Cys He Gly 

115 120 125 

Pro Asn Val Leu Met Val His Lys Arg Ser His Thr Glu Arg Pro Phe 

130 135 140 

Gin Cys Asn Gin Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu Leu 
145 150 155 150 

Ara His He Lys Leu His Ser Gly Glu Lys Pro Phe Lys Cys His Leu 

* 165 170 175 

Cys Asn Tyr Ala Cys Arg Arg Arg Asp Ala Leu Thr Gly His Leu Arg 

180 185 190 

Thr His Ser Val Gly Lys Pro His Lys Cys Gly Tyr Cys Gly Arg Ser 

195 200 205 

Tyr Lys Gin Arg Ser Ser Leu Glu Glu His Lys Glu Arg Cys His Asn 

210 215 220 

Tyr Leu Glu Ser Met Gly Leu Pro Gly Val Cys Pro Val He Lys Glu 
225 230 235 240 

Glu Thr Asn His Asn Glu Met Ala Glu Asp Leu Cys Lys He Gly Ala 

245 250 255 

Glu Arg Ser Leu Val Leu Asp Arg Leu Ala Ser Asn Val Ala Lys Arg 

260 265 270 

Lys Ser Ser Met Pro Gin Lys Phe Leu Gly Asp Lys Cys Leu Ser Asp 

275 280 285 

Met Pro Tyr Asp Ser Ala Asn Tyr Glu Lys Glu Asp Met Met Thr Ser 
290 295 300 

His Val Met Asp Gin Ala He Asn Asn Ala He Asn Tyr Leu Gly Ala 
305 310 315 320 

Glu Ser Leu Arg Pro Leu Val Gin Thr Pro Pro Gly Ser Ser Glu Val 

325 330 335 

Val Pro Val He Ser Ser Met Tyr Gin Leu His Lys Pro Pro Ser Asp 

340 345 350 

Gly Pro Pro Arg Ser Asn His Ser Ala Gin Asp Ala Val Asp Asn Leu 

355 360 365 

Leu Leu Leu Ser Lys Ala Lys Ser Val Ser Ser Glu Arg Glu Ala Ser 

370 375 380 

Pro Ser Asn Ser Cys Gin Asp Ser Thr Asp Thr Glu Ser Asn Ala Glu 
385 390 395 400 

Glu Gin Arg Ser Gly Leu He Tyr Leu Thr Asn His He Asn Pro His 

405 410 415 

Ala Arg Asn Gly Leu Ala Leu Lys Glu Glu Gin Arg Ala Tyr Glu Val 

420 425 430 

Leu Arg Ala Ala Ser Glu Asn Ser Gin Asp Ala Phe Arg Val Val Ser 

435 440 445 

Thr Ser Gly Glu Gin Leu Lys Val Tyr Lys Cys Glu His Cys Arg Val 
450 455 460 
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Leu Phe Leu Asp His Val Met Tyr Thr He His Met Gly Cys His Gly 
465 470 475 480 

Cys His Gly Phe Arg Asp Pro Phe Glu Cys Asn Met Cys Gly Tyr His 

485 490 495 

Ser Gin Asp Arg Tyr Glu Phe Ser Ser His He Thr Arg Gly Glu His 

500 505 510 

Arg Tyr His Leu Ser 
515 



<210> 8 

<211> 507 

<212> PRT 

<213> Mus musculus 

<400> 8 

Met Glu Asp He Gin Pro Thr Val Glu Leu Lys Ser Thr Glu Glu Gin 

15 10 15 

Pro Leu Pro Thr Glu Ser Pro Asp Ala Leu Asn Asp Tyr Ser Leu Pro 

20 25 30 

Lys Pro His Glu He Glu Asn Val Asp Ser Arg Glu Ala Pro Ala Asn 

35 40 45 

Glu Asp Glu Asp Ala Gly Glu Asp Ser Met Lys Val Lys Asp Glu Tyr 

50 55 60 

Ser Asp Arq Asp Glu Asn He Met Lys Pro Glu Pro Met Gly Asp Ala 
65 70 75 80 

Glu Glu Ser Glu Met Pro Tyr Ser Tyr Ala Arg Glu Tyr Ser Asp Tyr 

85 90 95 

Glu Ser He Lys Leu Glu Arg His Val Pro Tyr Asp Asn Ser Arg Pro 

100 105 HO 

Thr Ser Gly Lys Met Asn Cys Asp Val Cys Gly Leu Ser Cys He Ser 

115 120 125 

Phe Asn Val Leu Met Val His Lys Arg Ser His Thr Gly Glu Arg Pro 

130 135 140 

Phe Gin Cys Asn Gin Cys Gly Ala Ser Phe Thr Gin Lys Gly Asn Leu 
145 150 155 160 

Leu Arg His He Lys Leu His Thr Gly Glu Lys Pro Phe Lys Cys His 

165 170 175 

Leu Cys Asn Tyr Ala Cys Gin Arg Arg Asp Ala Leu Thr Gly His Leu 

180 185 190 

Arg Thr His Ser Val Glu Lys Pro Tyr Lys Cys Glu Phe Cys Gly Arg 

195 200 205 

Ser Tyr Lys Gin Arg Ser Ser Leu Glu Glu His Lys Glu Arg Cys Arg 

210 215 220 

Ala Phe Leu Gin Asn Pro Asp Leu Gly Asp Ala Ala Ser Val Glu Ala 
225 230 235 240 

Arg His He Lys Ala Glu Met Gly Ser Glu Arg Ala Leu Val Leu Asp 

245 250 255 

Arg Leu Ala Ser Asn Val Ala Lys Arg Lys Ser Ser Met Pro Gin Lys 

260 265 270 

Phe He Gly Glu Lys Arg His Cys Phe Asp Ala Asn Tyr Asn Pro Gly 

275 280 285 

Tyr Met Tyr Glu Lys Glu Asn Glu Met Met Gin Thr Arg Met Met Asp 

290 295 300 

Gin Ala He Asn Asn Ala He Ser Tyr Leu Gly Ala Glu Ala Phe Arg 
305 310 315 320 

Pro Leu Val Gin Thr Pro Pro Ala Pro Thr Ser Glu Met Val Pro Val 

325 330 335 

He Ser Ser Val Tyr Pro He Ala Leu Thr Arg Ala Asp Met Pro Met 

340 345 350 

Gly Ala Pro Gin Glu Met Glu Lys Lys Arg He Leu Leu Pro Glu Lys 
355 360 365 



-13- 



He 




Pro 


Ser 


Glu 


Aro 


Glv 


Leu 


Ser 


Pro Asn Asn Ser Ala Gin Asp 












375 






380 






Ser 


Thr 


ASP 


Thr 


ASP 


Ser 


Asn 


His 


Glu 


Asp Arg Gin His Leu 


Tyr 


Gin 








390 








395 




400 


Gin 


Ser 


His 


Val 


Val 
405 


Leu 


Pro 


Gin 


Ala 


Arg Asn Gly Met Pro 
410 


Leu 
415 


Leu 


jjy o 


Glu 


Val 


Pro 


Atq 


Ser 


Phe 


Glu 


Leu 


Leu Lys Pro Pro Pro 


He 


Cys 






420 








425 


430 






Leu. 

V ^4 


Arg 


Asp 


Ser 


He 


Lys 


Val 


He 


Asn 


Lys Glu Gly Glu Val 


Met 


Asp 




435 
t j j 










440 




445 






Val 


Phe 


Arg 


Cys 


Asp 


His 


Cys 


His 


Val 


Leu Phe Leu Asp Tyr 


Val 


Met 










455 






460 






Phe 


Thr 


He 


His 


Met 


Gly 


Cys 


His 


Gly 


Phe Arg Asp Pro Phe 


Glu 


Cys 


465 










470 








475 




480 


Asn 


Met 


Cys 


Gly 


Tyr 


Arg 


Ser 


His 


Asp 


Arg Tyr Glu Phe Ser 


Ser 


His 






485 










490 


495 




He 


Ala 


Arg 


Gly 


Glu 


His 


Arg 


Ala 


Met 


Leu Lys 










500 










505 









<210> 9 
<211> 17 
<212> DNA 

<213> Mus musculus 



<400> 9 

gggtgaaggc ctcaggt 17 

<210> 10 

<211> 33 

<212> DNA 

<213> Mus musculus 

<400> 10 

ccatcatatg agactgcatc agctccagcc tec 33 

<210> 11 

<211> 32 

<212> DNA 

<213> Mus musculus 

<400> 11 

ggaggctgag ctgatgeact ctcatatgat gg 32 

<210> 12 

<211> 30 

<212> DNA 

<213> Mus musculus 

<400> 12 

cacctacctt ggagctgagg cccttcaccc 30 

<210> 13 

<211> 21 

<212> DNA 

<213> Mus musculus 

<400> 13 

tggccctctg tggtgctcaa g 21 



<210> 14 
<211> 20 
<212> DNA 
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<213> Mus musculus 

<400> 14 
cacaggacta gaacacctgc 

<210> 15 

<211> 18 

<212> DNA 

<213> Mus musculus 

<400> 15 
gaacacgcca atatggcc 

<210> 16 

<211> 21 

<212> DNA 

<213> Mus musculus 

<400> 16 
ggccttggta gcatccaaag c 

<210> 17 

<211> 20 

<212> DNA 

<213> Mus musculus 



<400> 17 
agaatgtcag catggaggct 
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